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SPLICE VARIANTS OF ONCOGENES 

FIELD OF THE INVENTION 

The present invention concerns novel nucleic acid sequences, vectors and 
host cells containing them, amino acid sequences encoded by said sequences, and 
antibodies reactive with said amino acid sequences, as well as pharmaceutical 
5 compositions comprising any of the above. The present invention further concems 
methods for screening for candidate activators or deactivators utilizing said amino 
acid sequences. The invention also concems diagnostic assays utilizing said 
sequences. 

BACKGROUND OF THE INVENTION 

10 Many genes which are involved with tumors are involved with fiinctions 

which encourage and promote growth and division of cells. Some tumor-involved 
cells are expressed only in tumor cells, while others are expressed also in normal 
cells albeit at lower levels. A specific example of tumor-involved genes are 
oncogenes, which are muted formed of proto-oncogenes. 

15 Generally, proto-oncogenes code for cellular proteins that relay signals to 

the cell's nuclei thus stimulating growth. These cellular proteins respond to 
signals from other cells and the signaling process involves several steps among 
them binding of growth and proliferation regulating factors tb the cell membrane, 
release of second messenger, and a host of other intermediates, in the cell 

20 cytoplasm, and activation in the nucleus of transcription factors which move the 
cells through their growth cycles. 

Proto-oncogenes that code for these various components in the cascade 
may mutate, thus becoming oncogenes that keep the pathways continuously 
active regardless of the extracellular signals received by the cell. This may result 

25 in over-production of grov^h factors, flooding of the cell with replication signals, 
uncontrolled stimulation of the intermediary pathways and unrestrained cell 
growth driven by elevated levels of transcription factors. 
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The activation of a proto-oncogene to express its oncogenic potential may 
occur due to point mutation, chromosome rearrangement, gene amplification (an 
increase in the number of copies of normal proto-oncogenes within a cell) and 
viral insertion resulting in the control of the expression of the proto-oncogene by 
5 a more active promoter. 

Typically, oncogenes exhibit dominant phenotype at the cellular level, i.e. 
one copy of an activated oncogene is sufficient to produce its oncogenic effect, a 
phenomena which is termed "gain of function ". There is usually a requirement to 
have more than one mutation in the proto-oncogene in order to change a normal 
10 cell line into neoplasia. The oncogene may be transmitted firom generation to 
generation when a proto-oncogene mutates in the germ line, and since as 
indicated above usually more than one mutation is required, a single mutation 
results in a dominantly inherited tumor predisposition. 

The detection of oncogene is of major importance in the detection of 
15 tumors as well as in the detection of predisposition to a specific kind of tumor, 
which may result from additional mutations on an already mutated pro-oncogene. 
Oncogenes are detected by a plurality of methods among them PGR 
amplification, hybridization, as well as detection of the oncogenic product by 
various immunoassays. The understanding of the site of activity of the oncogene 
20 is of course of a major importance in the designing of a suitable therapeutical 
model for the treatment of the cancer resulting from the activity of said oncogene. 

Alternative splicing (AS) is an important regulatory mechanism in higher 
eukaryotes (P.A. Sharp, Cell 11, 805-8152 (1994). It is thought to be one of the 
most important mechanisms for differential expression related to tissue or 
25 development stage specificity. AS influences also: protein stability, protein 
clearance as well as tissue and cellular localization As may further alter protein 
function by increasing or decreasing the functionality,and may further affect post 
translational modifications, It is known to play a major role in numerous 
biological systems, including human antibody responses, and sex determination 
30 in Drosophila, (S. Stamm, M.Q. Zhang, T.G. Marr and D.M. Helfman, Nucleic 
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Acids Research 22, 1515-1526 (1994); B. Chabot, Trends Genet 12, 472-478 
(1996); R.E. Breitbart, A. Andreadis, B. Nadal-Ginard, Annual Rev, Biochem,, 
56, 467-495 (1987); C.W. Smith, J.G. Patton, B, Nadal-Ginard, Annu, Rev, 
Genet. 27, 527-577(1989)). 

5 Until recently it was commonly believed that alternative splicing existed in 

only a small fraction of genes (about 5%). A recent observation based on 
literature survey of known genes revises this conservative estimate to as high as 
an estimate that at least 30% of human genes are alternatively spliced (M.S. 
Gelfand, I. Dubchak, I. Draluk and M. Zom, Nucleic Acids Research 27, 301-302 

10 (1999). The importance of the actual frequency of this phenomenon lies not only 
in the direct impact on the number of proteins created (100,000 human genes, for 
example, would be translated to a much higher number of proteins), but also in 
the diversity of functionality derived from the process. 

Several mechanisms at different stages may be held responsible for the 

15 complexity of higher eukaryote which include: alternative splicing at the 
transcription level, RNA editing at the post-transcriptional level, and 
post-translational modifications are the ones characterized to date. 

GLOSSARY 

20 In the following description eind claims use will be made, at times, with a 

variety of terms, and the meaning of such terms as they should be construed in 
accordance with the invention is as follows: 

^^Tumor-involved genes (TIGy^ - genes for which there is some scientific 
25 indication linking their function, expression, or change in the level of their 
expressionto tumors. This term does not signify necessarily that the genes cause the 
tumor (although in some cases this is so) but may also indicate that the genes are a 
result of the tumor process, for example, they are activated by other genes which 
are the cause of the tumor. 

30 
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''Variant nucleic acid sequence** - the sequence shown in any one of SEQ ID 
NO: 1 to SEQ ID NO: 36, sequences having at least 90% identity (see below) to 
said sequence m\d fragments (see below) of the above sequences of least 20 b.p. 
long. These sequences are sequences coding for a novel, naturally occurring, 
5 alternative splice variants of native and known genes which are tumor-involved 
genes (TIG). It should be emphasized that the novel variants of the present 
invention are naturally occurring sequences resulting from alternative splicing of 
the TIGS and not merely truncated, mutated or fragmented forms of known 
tumor-involved sequences which are artificially produced. 

10 

''Variant product - also referred at times as the ''variant protein** or "variant 
polypeptide** - is an amino acid sequence encoded by the variant nucleic acid 
sequence which is a naturally occurring mRNA sequence obtained as a result of 
altemative splicing. The amino acid sequence may be a peptide, a protein, as well 

15 as peptides or proteins having chemically modified amino acids (see below) such as 
a glycopeptide or glycoprotein. The variant products are shown in any one of SEQ 
ID NO: 37 to SEQ ID NO: 72. The term also includes homologues (see below) of 
said sequences in which one or more amino acids has been added, deleted, 
substituted (see below) or chemically modified (see below) as well as fragments 

20 (see below) of this sequence having at least 10 amino acids. 

"Nucleic acid sequence** — a sequence composed of DNA nucleotides, RNA 
nucleotides or a combination of both types and may includes natural nucleotides, 
chemically modified nucleotides and synthetic nucleotides. 

25 

"Amino acid sequence** - a sequence composed of any one of the 20 naturally 
appe£iring amino acids, amino acids which have been chemically modified (see 
below), or composed of synthetic amino acids. 
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^'Fragment of variant nucleic acid sequence" - novel short stretch of nucleic 
acid sequences of at least 20 b.p., which does not appear as a continuous stretch 
in the original -nucleic acid sequence (see below). The fragment may be a 
sequence which was previously undescribed in the context of the published RNA 

5 and which affects the amino acid sequence encoded by the known oncogene. For 
example, where the variant nucleic includes a sequence which was not included 
in the original sequence of the oncogene (for example a sequence which was an 
intron in the original sequence) the fragment may contain said additional 
sequence. The fragment may also be a region which is not an intron, which was 

10 not present in the original sequence of the TIG. For example where the variant 
lacks a non-terminal region which was present in the original sequence of the 
TIG. The two stretches of nucleotides spanning this region (upstream and 
downstream) are brought together by splicing in the variant, but are spaced from 
each by the spliced out region in the original sequence of the TIG and are thus 

15 not continuous in the original sequence. A continuous stretch of nucleic acids 
comprising said two sparing stretches of nucleotides is not present in the original 
sequence of the TIG and thus falls under the definition of fi-agment. 

^'Fragments of variant products" - novel amino acid sequences coded by the 
20 'fragment of variant nucleic acid sequence " defined above. 

''Homologues of variants" - amino acid sequences of variants in which one or 
more amino acids has been added, deleted or replaced. The addition, deletion or 
replacement should be in the regions or adjacent to regions where the variant 
25 differs from the original sequence (see below) of the TIG. 

^'Conservative substitution'^ - refers to the substitution of an amino acid in one 
class by an amino acid of the same class, where a class is defined by common 
physicochemical amino acid side chain properties and high substitution 
30 frequencies in homologous proteins found in nature, as determined, for example. 
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by a standard Dayhoff frequency exchange matrix or BLOSUM matrix. [Six 
general classes oif amino acid side chains have been categorized and include: 
Class I (Cys); Class II (Ser, Thr, Pro, Ala, Gly); Class III (Asn, Asp, Gin, Glu); 
Class IV (His, Arg, Lys); Class V (He, Leu, Val, Met); and Class VI (Phe, Tyr, 
5 Trp). For example, substitution of an Asp for another class III residue such as 
Asn, Gin, or Glu, is a conservative substitution. 

'^Non-conservative substitution " - refers to the substitution of an amino acid in 
one class with an amino acid from another class; for example, substitution of an 
10 Ala, a class II residue, with a class III residue such as Asp, Asn, Glu, or Gin. 

^'Chemically modified'' - when referring to the product of the invention, means a 
product (protein) where at least one of its amino acid resides is modified either by 
natural processes, such as processing or other post-translational modifications, or 

15 by chemical modification techniques which are well known in the art. Among 
the numerous known modifications typical, but not exclusive examples include: 
acetylation, acylation, amidation, ADP-ribosylation, glycosylation, GPI anchor 
formation, covalent attachment of a lipid or lipid derivative, methylation, 
myristlyation, pegylation, prenylation, phosphorylation, ubiqutination, or any 

20 similar process. 

"Biologically active" - refers to the variant product having some sort of 
biological activity, for example, some physiologically measurable effect on target 
cells, molecules or tissues. 

25 

"Immunologically active" defines the capability of a natural, recombinant or 
synthetic varient product, or any fragment thereof, to induce a specific immune 
response in appropriate animals or cells and to bind with specific antibodies. 
Thus, for example, an immunologically active fi-agment of variant product 
30 denotes a fragment which retains some or all of the immunological properties of 



the variant product, e.g can bind specific anti-variant product antibodies or which 
can elicit an immune response which will generate such antibodies or cause 
proliferation of specific immune cells which produce variant. 

5 Optimal alignment^^ - is defined as an alignment giving the highest percent 
identity score. Such alignment can be performed using a variety of commercially 
available sequence analysis programs, such as the local alignment program 
LALIGN using a ktup of 1 , default parameters and the default PAM. A preferred 
alignment is; the one performed using the CLUSTAL-W program from 

10 Mac Vector (TM), operated with an open gap penalty of 10.0, an extended gap 
penalty of 0.1, and a BLOSUM similarity matrix. If a gap needs to be inserted 
into a first sequence to optimally align it with a second sequence, the percent 
identity is calculated using only the residues that are paired with a corresponding 
amino acid residue (i.e., the calculation does not consider residues in the second 

15 sequences that are in the *'gap'' of the first sequence). In case of alignments of 
known gene sequences with that of the new variant, the optimal alignment 
invariably included aligning the identical parts of both sequences together, then 
keeping apart and unaligned the sections of the sequences that differ one from the 
other. 

20 

^'Having at least 90% identity - with respect to two amino acid or nucleic acid 
sequences, refers to the percentage of residues that are identical in the two 
sequences when the sequences are optimally aligned. Thus, 90% amino acid 
sequence identity means that 90% of the amino acids in two or more optimally 
25 aligned polypeptide sequences are identical, however this definition explicitly 
excludes sequences which are 100% identical with the original sequence from 
which the variant of the invention was varied. 

'^Isolated nucleic acid molecule having an variant nucleic acid sequence*^ - is a 

30 nucleic acid molecule that includes the coding variant nucleic acid sequence. Said 
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isolated nucleic acid molecule may include the variant nucleic acid sequence as 
an independent insert; may include the variant nucleic acid sequence fused to an 
additional coding sequences, encoding together a fusion protein in which the 
variant coding sequence is the dominant coding sequence (for example, the 
5 iadditional coding sequence may code for a signal peptide); the variant nucleic 
acid sequence may be in combination with non-coding sequences, e.g., introns or 
control elements, such as promoter and terminator elements or 5' and/or 3* 
untranslated regions, effective for expression of the coding sequence in a suitable 
host; or may be a vector in which the variant protein coding sequence is a 
10 heterologous. 

Expression vector'^ - refers to vectors that have the ability to incorporate and 
express heterologous DNA fragments in a foreign cell. Many prokaryotic and 
eukaryotic expression vectors are known and/or commercially available. 
15 Selection of appropriate expression vectors is within the knowledge of those 
having skill in the art. 

''Deletion'^ - is a change in either nucleotide or amino acid sequence in which 
one or more nucleotides or amino acid residues, respectively, are absent. 

20 

Insertion^* or addition'^ - is that change in a nucleotide or amino acid 
sequence which has resulted in the addition of one or more nucleotides or amino 
acid residues, respectively, as compared to the naturally occurring sequence. 

25 Substitution^^ - replacement of one or more nucleotides or amino acids by 
different nucleotides or amino acids, respectively. As regards amino acid 
sequences the substitution may be conservative or non- conservative. 

''Antibody'' - refers to IgG, IgM, IgD, IgA, or IgG antibody. The definition 
30 includes polyclonal antibodies or monoclonal antibodies. This term refers to 

^- 
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whole antibodies or fragments of the antibodies comprising the antigen-binding 
domain of the anti-variant product antibodies, e.g. antibodies without the Fc 
portion, single chain antibodies, fragments consisting of essentially only the 
variable, antigen-binding domain of the antibody, etc. 

5 

Distinguishing antibody^' - an antibody capable of binding to the variant product 
and not the original amino acid sequence of the tumor- involved gene from which 
it has been varied, or an antibody capable of binding to the original nucleic acid 
sequence and not to the variant product. 

10 

Activator - as used herein, refers to a molecule which mimics the effect of the 
natural variant product or at times even increases or prolongs the duration of the 
biological activity of said product, as compared to that induced by the variant 
product. The mechanism may be by any mechanism known to prolonging 

15 activities of biological molecules such as binding to receptors; prolonging the 
lifetime of the molecules; increasing the activity of the molecules on its target; 
increasing the affinity of molecules to its receptor; inhibiting degradation' or 
proteolysis of the molecules, or mimicking the biological activity of the variants 
on their targets, etc. Activators may be polypeptides, nucleic acids, 

20 carbohydrates, lipids, or derivatives thereof, or any other molecules which can 
bind to and activate the variant product. 

Deactivator'^ or C^Inhibitor'^) - refers to a molecule which modulates the 
activity of the variant product in an opposite manner to that of the activator, by 
25 decreasing or shortening the duration of the biological activity of the variant 
product. This may be done by any mechanism known to deactivate or inhibit 
biological molecules such as block of the receptor, block of active site, 
competition on binding site in target, enhancement of degradation, etc. 
Deactivators may be polypeptides, nucleic acids, carbohydrates, lipids, or 
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derivatives thereof, or any other molecules which bind to and modulate the 
activity of said product. 

^^Treating a disease'^ - refers to administering a therapeutic substance effective 
5 to ameliorate symptoms associated with a disease, to lessen the severity or cure 
the disease, or to prevent the disease from occurring. Typically the disease is 
cancer. 

''Detection'' — refers to a method of detection of a cancer. This term may refer to 
10 detection of a predisposition to cancer as well as for establishing the prognosis of 
the patient by determining the severity of the disease, i.e. determining in which 
stage the cancer is. 

''Probe'' - the variant nucleic acid sequence, or a sequence complementary 
15 therewith, when used to detect presence of other similar sequences in a sample. 
The detection is carried out by identification of hybridization complexes between 
the probe and the assayed sequence. The probe may be attached to a solid support 
or to a detectable label. 

20 ''Original sequence'^ - the amino acid or nucleic acid sequence of the tumor- 
involved gene (TIG) from which the variant of the invention have been varied as 
a result of alternative slicing. This sequence will also be referred to at times as 
"tumor-involved-gene " (TIG) . 

SUMMARY OF THE INVENTION 

25 The present invention is based on the finding of several novel, naturally 

occurring splice variants, which are naturally occurring sequences obtained by 
alternative splicing of known genes which expression was reported in scientific 
literature to be involved with tumors (hereinafter tumor-involved genes'* or 
"TIGS"), The above term does not signify that the gene necessarily caused the 
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tumor (although this may be so), merely that they are involved therewith (i.e. 
expressed in tumors) and this expression may be the result of other effects, for 
example, as a result of expression of other genes. The novel splice variants of the 
invention are not merely truncated forms, fragments or mutations of the known 
5 tumor-involved genes, but rather novel sequences which naturally occur within the 
body of individuals, and thus have physiological significance. 

The terai ''alternative splicing'* in the context of the present invention and 
claims refers to: intron inclusion, exon exclusion, addition or deletion of terminal 
sequences in the variant as compared to the original sequences, as well as to the 

10 possibility of ''intron retention'*. Intron retention is an intermediate stage in the 
processing of RNA tremscripts, where prior to production of fully processed mRNA 
the intron (naturally spliced in the original TIG sequence) is retained in the variant. 
These intermediately processed RNAs may have physiological significance and are 
also within the scope of the invention. 

15 The novel variant products of the invention may have the same 

physiological activity as the original tumor-involved peptide from which they have 
been varied (although perhaps at a different level); may have an opposite 
physiological activity from the activity featured by the original tumor-involved 
peptide from which they are varied; may have a completely different, unrelated 

20 activity to the activity of the original tumor-involved peptide which they are varied; 
or alternatively may have no activity at all and this may lead to various diseases or 
pathological conditions, especially cancer. Both in the case the variant has the same 
activity as well as the case it has the opposite activity as the original TIG sequence, 
it may differ from the TIG in its stability, its clearance rate and rate of degradation 

25 its tissue and cellular localization, its ligand specificity, its cellular distribution, its 
temporal expression pathway, manner for up and down regulation and in other 
biological properties not necessarily connected to activity. 

The novel variants may also serve for detection purposes, i.e. their presence 
or level may be cancer, a predisposition to cancer or the stage and aggression of the 

30 cancer disease, or altematively the ratio between the level variants and the level 
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original peptide from which they were varied, or the ratio to other variants (all 
obtained by alternative splicing from the same original sequence of the 
tumor-involved gene) may be indicative of the presence of cancer, predisposition 
to cancer or the stage and aggressiveness of the cancer disease. 

5 For example, for detectional purposes, it is possible to establish differential 

expression of various variants in various tissues. A certain variant may be 
expressed mainly in one tissue, while the original sequence (tumor-involved 
sequence) from which it has been varied, or another variant (obtained by alternative 
splicing from the same original tumor-involved sequence) may, be expressed 

10 mainly in another tissue. Understanding of the distribution of the variants in various 
tissues may be helpful in basic research, for understanding the physiological 
function of the original tumor-involved genes from which they have been varied, as 
well as help in targeting pharmaceuticals or in developing pharmaceuticals, and in 
establishing more accurate modalities of diagnosis. 

15 The study of the variants may also be helpful in distinguishing various 

stages in the life cycles of the same type of cells which may also be helpful for 
development of pharmaceuticals for various cancer stages in which cell cycles is 
non-normal. 

Thus the detection may by determination of the presence or the level of 
20 expression of the variant within a specific cell population, comparing said presence 
or level between various cell types in a tissue, between different tissues and 
between individuals. 

Thus the present invention provides by its first aspect, a novel isolated 
nucleic acid molecule comprising or consisting of any one of the coding sequence 
25 SEQ ID NO: 1 to SEQ ID NO: 36, fragments of said coding sequence having at 
least 20 nucleic acids (provided that said fragments are continuous stretches of 
nucleotides not present in the original sequence from which the variant was varied), 
or a molecule comprising a sequence having at least 90%, identity to SEQ ID 
NO: 1 to SEQ ID NO: 36, provided that the molecule is not completely identical to 
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the original sequence of the tumor-involved gene from which the variant was 
varied. 

The present invention further provides a protein or polypeptide comprising 
or consisting of an amino acid sequence encoded by any of the above nucleic acid 
5 sequences, termed herein "variant product '\ for example, an amino acid sequence 
having the sequence as depicted in any one of SEQ ID NO: 37 to SEQ ID NO: 72, 
fragments of the above amino acid sequence having a length of at least 10 amino 
acids coded by the above fragments of the nucleic acid sequences, as well as 
homologues of the above amino acid sequences in which one or more of the amino 
10 acid residues has been substituted (by conservative or non-conservative 
substitution) added, deleted, or chemically modified. 

The deletions, insertions and modifications should be in regions, or adjacent 
to regions, wherein the variant differs from the original sequence of the tumor- 
involved gene. 

15 For example, where the variant is different from the original sequence of the 

tumor- involved gene by addition of a short stretch of 10 amino acids, in the 
terminal or non- terminal portion of the peptide i.e. inclusion of an exon, the 
invention also concerns homologues of that variant where the additional short 
stretch is altered for example, it includes only 8 additional amino acids, includes 13 

20 additional amino acids, or it includes 10 additional amino acids, however some of 
them being conservative or non-conservative substitutes of the original additional 
10 amino acids of the novel variants. In all cases the changes in the homolog, as 
compared to the original tumor-involved sequence, are in the same regions where 
the variant differs from the original sequence, or in regions adjacent to said region. 

25 Another example is where the variant lacks a non-terminal region (for 

example of 20 amino acids) which is present in the original tumor-involved 
sequence (due for example to exon exclusion). The homologues may lack in the 
same region only 17 amino acids or 23 amino acids. Again the deletion is in the 
same region where the variant lacks a sequence as compared to the original 

30 tumor-involved sequence, or in a region adjacent thereto. It should be appreciated 

r 



-14- 

that once a man versed in the art's attention is directed to the importance of a 
specific region, due to the fact that this region differs in the variant as compared to 
the original sequence of the tumor-involved gene, there is no problem in derivating 
said specific region by addition to it, deleting from it, or substituting some amino 
5 acids in it. Thus homologues of variants which are derivated firom the variant by 
changes (deletion, addition, substitution) only in said region as well as in regions 
adjacent to it are also a part of the present invention. Generally, if the variant is 
distinguished from the original sequence of the tumor-involved gene by some sort 
of physiological activity, then the homolog is distinguished firom the original 
1 0 tumor-involved sequence in essentially the same manner. 

The present invention fiarther provides nucleic acid molecule comprising or 
consisting of a sequence which encodes the above amino acid sequences, 
(including the fi-agments and homologues of the amino acid sequences). Due to the 
degenerative nature of the genetic code, a plurality of alternative nucleic acid 
15 sequences, beyond those depicted in any one of SEQ ID NO: 1 to SEQ ID NO: 36, 
can code for the amino acid sequences of the invention. Those alternative nucleic 
acid sequences which code for the same amino acid sequences £is coded by the 
sequence SEQ ID NO: 1 to SEQ ID NO: 36 (i.e. SEQ ID NO: 37 to SEQ ID 
NO: 72) are also an aspect of the of the present invention. 
20 The present invention further provides expression vectors and cloning 

vectors comprising any of the above nucleic acid sequences, as well as host cells 
transfected by said vectors. 

The present invention still fiarther provides pharmaceutical compositions 
comprising, as an active ingredient, said nucleic acid molecules, said expression 
25 vectors, or said protein or polypeptide. 

These pharmaceutical compositions are suitable for the treatment of various 
cancers, which can be ameUorated or cured by raising the level of any one of the 
variant products of the invention. 

By a second aspect, the present invention provides a nucleic acid molecule 
30 comprising or consisting of a non-coding sequence which is complementary to that 
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of any one of SEQ ID NO: 1 to SEQ ID NO: 36, or complementary to a sequence 
having at least 90% identity to said sequence (with the proviso added above) or a 
fragment of said two sequences (according to the above definition of fragment). 
The complementary sequence may be a DNA sequence which hybridizes with any 

5 one of SEQ of ID NO: 1 to SEQ ID NO: 36 or hybridizes to a portion of that 
sequence having a length sufficient to inhibit the transcription of the 
complementary sequence. The complementary sequence may be a DNA sequence 
which can be transcribed into an mRNA being an antisense to the mRNA 
transcribed from any one of SEQ ID NO: 1 to SEQ ID NO: 36 or into an mRNA 

10 which is an antisense to a fragment of the mRNA transcribed from any one of SEQ 
ID NO: 1 to SEQ ID NO: 36 which has a length sufficient to hybridize with the 
mRNA transcribed from SEQ ID NO: 1 to SEQ ID NO: 36, so as to inhibit its 
translation. The complementary sequence may also be the mRNA or the fragment 
ofthe mRNA itself. 

15 The nucleic acids of the second aspect of the invention may be used for 

therapeutic or diagnostic applications for example as probes used for the detection 
ofthe variants of the invention. The presence of the variant transcript or the level of 
the variant transcript may be indicative of cancer, predisposition to cancer or the 
stage or aggressiveness of the cancer disease. In addition or alternatively, the ratio 

20 ofthe level of the transcripts of the variants of the invention may also be compared 
• to that of the transcripts of the original sequences of the oncogenes from which 
have been varied, or to the level of transcript of other variants (especially obtained 
by altemative splicing from the same original sequence), and said ratio may be 
indicative of cancer, predisposition to cancer or the stage or aggressiveness of the 

25 cancer disease 

The present invention also provides expression vectors comprising any one 
of the above defined complementary nucleic acid sequences and host cells 
transfected with said nucleic acid sequences or vectors, being complementary to 
those specified in the first aspect of the invention. 

I 
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The invention also provides anti-variant product antibow:Os, namely 
antibodies directed against the variant product which specifically bind to said 
variant product. Said antibodies are useful both for diagnostic and therapeutic 
purposes. For example said antibody may be as an active ingredient in a 
5 pharmaceutical composition as will be explained below. 

The present invention also provides pharmaceutical compositions 
comprising, as an active ingredient, the nucleic acid molecules which comprise or 
consist of said complementary sequences, or of a vector comprising said 
complementary sequences. The pharmaceutical composition thus provides 
10 pharmaceutical compositions comprising, as an active ingredient, said anti-variant 
product antibodies. 

The pharmaceutical compositions comprising said anti-variant product 
antibodies or the nucleic acid molecule comprising said complementary sequence, 
are suitable for the treatment of diseases and pathological conditions where a 

1 5 therapeutically beneficial effect may be achieved by neutralizing the variant (either 
at the transcript or product level) or decreasing the amount of the variant product or 
blocking its binding to its target, for example, by the neutralizing effect of the 
antibodies, or by the effect of the antisense mRNA in decreasing the expression 
level of the variant sequence. In particular these diseases are cancer diseases and 

20 the treatment may also be for amelioration of cancer or for prevention of cancer 
purposes. 

According to the third aspect of the invention the present invention provides 
methods for detecting the level of the transcript (mRNA) of said variant product in 
a body fluid sample, or in a specific tissue sample, for example by use of probes 

25 comprising or consisting of said coding sequences; as well as methods for detecting 
levels of expression of said product in tissue, e.g. by the use of antibodies capable 
of specifically reacting with the variant products of the invention. Detection of the 
level of the expression of the variant of the invention in particular as compared to 
that of the original tumor-involved gene sequence from which it was varied or 

30 compared to other variant sequences all varied from the same original TIG 
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sequence may be indicative of a cancer, predisposition to cancer or the stage or 
aggressiveness of the cancer disease 

The method, according to this latter aspect, for detection of a nucleic acid 
sequence which encodes the variant product in a biological sample, comprises the 
5 steps of: 

(a) providing a probe comprising at least one of the nucleic acid 
sequences defined above; 

(b) contacting the biological sample with said probe under conditions 
allowing hybridization of nucleic acid sequences thereby enabling formation of 

1 0 hybridization complexes ; 

(c) detecting hybridization complexes, wherein the presence of the 
complexes indicates the presence of nucleic acid sequence encoding the variant 
product in the biological sample. 

The method as described above is qualitative, i.e. indicates whether the 
15 transcript is present in or absent from the sample. The method can also be 
quantitative, by detennining the level of hybridization complexes and then 
calibrating said levels to determining levels of transcripts of the desired variant in 
the sample. 

Both qualitative and quantitative determination methods can be used for 
20 diagnostic, prognostic and therapy planning purposes, especially in conjunction 
with cancer diseases. In addition qualitative determination may be indicative of the 
cancer stage. 

By a preferred embodiment the probe is part of a nucleic acid chip used for 
detection purposes, i.e. the probe is a part of an array of probes each present in a 
25 known location on a solid support. 

The nucleic acid sequence used in the above method may be a DNA 
sequence an RNA sequence, etc; it may be a coding or a sequence or a sequence 
complementary thereto (for respective detection of RNA transcripts or 
coding-DNA sequences). By quantization of the level of hybridization complexes 
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and calibrating the quantified results it is possible also to detect the level of the 
transcript in the sample. 

Methods for detecting mutations in the region coding for the variant product 
are also provided, which may be methods carried-out in a binary fashion, namely 
5 merely detecting whether there is any mismatches between the normal variant 
nucleic acid sequence of the invention and the one present in the sample, or 
carried-out by specifically detecting the nature and location of the mutation. 
Detection of mutations may be of importance in the determination of predisposition 
to Ccmcer, as well as in attempts to establish the prognosis of the cancer disease. 
10 The present invention also concems a method for detecting variant product 

in a biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of the invention, 
thereby fomiing an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

15 wherein the presence of said antibody-antigen complex correlates with the 

presence of variant product in said biological sample. 

Many diseases are diagnosed by detecting the presence of antibodies against 
a protein characterizing the disease in the blood, serum or any other body fluid of 
the patient. The present invention also concems a method for detecting anti-variant 

20 antibody in a biological sample, comprising: 

(a) contacting said sample with the variant product of the invention, 
thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
25 presence of anti-variant antibody in the sample. 

As indicated above, both methods (for detection of variant product and for 
detection of the anti-variant antibody) can be quantitized to determine the level or 
the amount of the variant or antibody in the sample, alone or in comparison to the 
level of the original amino acid tumor-involved sequence firom which it was varied 
30 or compared to the level of antibodies against the original amino acid sequence. 
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and qualitative and quantitative results may be used for diagnostic, prognostic and 
therapy planning purposes. 

The invention also concerns distinguishing antibodies, i.e. antibodies 
capable of binding either to the variant product or to the original tumor-involved 
5 gene sequence from which the variant has been varied, while not binding to the 
original sequence or the variant product respectively. These distinguishing 
antibodies may be used for detection purposes. 

By yet another aspect the invention also provides a method for identifying 
candidate compounds capable of binding to the variant product and modulating its 
10 activity (being either activators or deactivators). The method includes: 

(i) providing a protein or polypeptide comprising an simino acid 
sequence substantially as depicted in any one of SEQ ID NO: 37 to 72, or a 
fragment of such a sequence; 

(ii) contacting a candidate compound with said amino acid sequence; 

15 (iii) measuring the physiological effect of said candidate compound on 

the activity of the amino acid sequences and selecting those compounds which 
show a significant effect on said physiological activity. 

The present invention also concerns compounds identified by the above 
methods described above, which compound may either be an activator of the 

20 vari2mt product or a deactivator thereof. 

BRIEF DESCRIPTION OF THE DRAWINGS 

In order to understand the invention and to see how it may be carried out in 
practice, a preferred embodiment will now be described, by way of non-limiting 
25 example only, with reference to the accompanying drawings, in which: 

Fig. 1 is a comparison between the amino acid sequence of SEQ ID NO: 37 
and the original tumor-involved sequence from which it has been varied; 

Fig. 2 is a comparison between the amino acid sequence of SEQ ID NO: 38 
and the original tumor-involved sequence firom which it has been varied; 
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Fig. 3 is a comparison between the aiTiino acid sequence of SEQ ID NO: 39 
and the original tumor-involved sequence from which it has been varied; 

Fig- 4 is a comparison between the amino acid sequence of SEQ ID NO: 40 
and the original tumor-involved sequence from which it has been varied; 
5 Fig. 5 is a comparison between the amino acid sequence of SEQ ID NO: 41 

and the original tumor-involved sequence from which it has been varied; 

Fig. 6 is a comparison between the amino acid sequence of SEQ ID NO: 42 
and the original tumor-involved sequence from which it has been varied; 

Fig. 7 is a comparison between the amino acid sequence of SEQ ID NO: 43 
10 and the original tumor-involved sequence from which it has been varied; 

Fig. 8 is a comparison between the amino acid sequence of SEQ ID NO: 44 
and the original tumor- involved sequence from which it has been varied; 

Fig. 9 is a comparison between the amino acid sequence of SEQ ID NO: 45 
and the original tumor-involved sequence from which it has been varied; 
15 Fig. 10 is a comparison between the amino acid sequence of SEQ ID 

NO: 46 and the original tumor-involved sequence from which it has been varied; 

Fig. 11 is a comparison between the amino acid sequence of SEQ ID 
NO: 47 and the original tumor-involved sequence from which it has been varied; 

Fig. 12 is a comparison between the amino acid sequence of SEQ ID 
20 NO: 48 and the original tumor-involved sequence from which it has been varied; 

Fig. 13 is a comparison between the amino acid sequence of SEQ ID 
NO: 49 and the original tumor-involved sequence from which it has been varied; 

Fig. 14 is a comparison between the amino acid sequence of SEQ ID 
NO: 50 and the original tumor-involved sequence from which it has been varied; 
25 Fig. 15 is a comparison between the amino acid sequence of SEQ ID 

NO: 51 and the original tumor-involved sequence from which it has been varied; 

Fig. 16 is a comparison between the amino acid sequence of SEQ ID 
NO: 52 and the original tumor-involved sequence from which it has been varied; 

Fig. 17 is a comparison between the amino acid sequence of SEQ ID 
30 NO: 53 and the original tumor-involved sequence from which it has been varied; 
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Fig. 18 is a comparison between the amino acid sequence of SEQ ID 
NO: 54 and the original tumor-involved sequence from which it has been varied; 

Fig. 19 is a comparison between the amino acid sequence of SEQ ID 
NO: 55 and the original tumor-involved sequence from which it has been varied; 
5 Fig. 20 is a comparison between the amino acid sequence of SEQ ID 

NO: 56 and the original tumor-involved sequence from which it has been varied; 

Fig. 21 is a comparison between the amino acid sequence of SEQ ID 
NO: 57 and the original tumor-involved sequence from which it has been varied; 

Fig. 22 is a comparison between the amino acid sequence of SEQ ID 
10 NO: 58 and the original tumor-involved sequence from which it has been varied; 

Fig. 23 is a compeirison between the amino acid sequence of SEQ ID 
NO: 59 and the original tumor-involved sequence from which it has been varied; 

Fig. 24 is a comparison between the amino acid sequence of SEQ ID 
NO: 60 and the original tumor- involved sequence from which it has been varied; 
15 Fig. 25 is a comparison between the amino acid sequence of SEQ ID 

NO: 6 1 and the original tumor-involved sequence from which it has been varied; 

Fig. 26 is a comparison between the amino acid sequence of SEQ ID 
NO: 62 £ind the original tumor-involved sequence from which it has been varied; 

Fig. 27 is a comparison between the amino acid sequence of SEQ ID 
20 NO: 63 and the original tumor-involved sequence from which it has been varied; 

Fig. 28 is a comparison between the amino acid sequence of SEQ ID 
NO: 64 and the original tumor- involved sequence from which it has been varied; 

Fig. 29 is a comparison between the amino acid sequence of SEQ ID 
NO: 65 and the original tumor-involved sequence from which it has been varied; 
25 Fig. 30 is a comparison between the amino acid sequence of SEQ ID 

NO: 66 and the original tumor- involved sequence from which it has been varied; 

Fig. 31 is a comparison between the amino acid sequence of SEQ ID 
NO: 67 and the original tumor-involved sequence from which it has been varied; 

Fig. 32 is a comparison between the amino acid sequence of SEQ ID 
30 NO: 68 and the original tumor-involved sequence from which it has been varied; 
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Fig. 33 is a comparison between the amino acid sequence of SEQ ID 
NO: 69 and the original tumor- involved sequence from which it has been varied; 

Fig. 34 is a comparison between the amino acid sequence of SEQ ID 
NO: 70 and the original tumor-involved sequence from which it has been varied; 
5 Fig. 35 is a comparison between the amino acid sequence of SEQ ID 

NO: 71 and the original tiunor-involved sequence from which it has been varied; 

Fig. 36 is a comparison between the amino acid sequence of SEQ ID 
NO: 72 and the original tumor- involved sequence from which it has been varied. 

1 0 DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Example I: Comparison of variants with original sequences 

Original sequences of tumor-involved genes were obtained from GenBank 
Version 115. Their tumor involvement was determined by comparison between 
the original sequences and the noval variant sequences was made using the 
15 BestFit application from the GCG suite version 10.0 (January 1999), with the 
defalut values: 

Gap creation penalty (Gap Weight): 50 

Gap extension penalty (GapLength Weight): 3 

The comparison is shown in Fig. 1 to 34 which show the comparison of 
20 each of the variant products depicted in SEQ ID NO: 37 to 72 with the original 
tumor-involved sequence from which it was varied. 

The following is a list which gives the name and the description of each 
original tumor-involved sequence from which the alternative splice variant has 
been varied by alternative splicing. The description is followed by the internal 
25 reference to the novel variant (NV-... etc.) and a short comparison between the 
variant and the original tumor-involved sequence. It should be noticed that several 
splice variants may have been originated from the same parent sequence by several 
different alternative splicings. The following table summarizes the accession 
number of the original sequence, the terminology of the new variant (NV-1 to 



NV-34) and the description of the difference between the new variant and the 
original sequence. 



Table 



Accession 


New variant # 


Description of the new variant 


KU70_HUMAN 


NV-1 


The new variant has an alternative 3' 
exon of 5 aa instead of 240 amino 
acids. It is probably missing the 
PHOSPHORYLATION (BY 
NUCLEAR KINASE Nil) site and 
half of the PRO-RICH domain but 
retains the LEUCINE-ZIPPER 
domain. 


KU70_HUMAN 


NV-2 


The new variant has a deletion of 210 
aa between residues 304- 515. Lacks 
the Pro-rich domain but retains the 
LEUCINE-ZIPPER domain and 
PHOSPHORYLATION (BY 
NUCLEAR KINASE Nil) site. 


LCK_HUMAN 


NV-3 


The new variant has an alternative 3' 
exon of 45 amino acids instead of 163 
amino acids. The new variant retains 
both SH domains and most of the 
PROTEIN KINASE domain including 
two ATP BINDING sites and the 
ACTIVE SITE. It is missing the 3' 
end of the PROTEIN KINASE 
domain and lacks the 
AUTO-PHOSPHORYLATION and 
PHOSPHORYLATION sites. 


LCK_HUMAN 


NV-4 


Insertion of 58 amino acids after 
amino acid 62 (insertion does not 
result in truncation). Insertion in first 
SH2 domain. The new variant retains 
all important sites including: the 
PROTEIN KINASE DOMAIN with 
two ATP BINDING sites, an ACTIVE 
SITE and an AUTO 
PHOSPHORYLATION site. An 
additional PHOSPHORYLATION 
site. 
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OSTP_HUMAN 


NV-5 


The new variant has an alternative 3' 
exon of 12 aa instead of 134 aa. The 
new variant maintains the CELL 
All A. \^ rnvijii IN 1 i>i 1 n ana two 

GLYCOSILATION sites. 


GA45_HUMAN 


NV-6 


The new variant has an alternative 5' 
exon of 72 amino acids instead of 125 
amino acids. The new variant has a 
signal peptide and has the two 
PHOSPHORYLATION (BY CK2) 
sites. 


WN11_HUMAN 


NV-7 


The new variant has a deletion of 22 
amino acids after residue 312 
(between 312-334). The new variant 
has all five potential 
CjLYCOblLAllUN Sites. 


WN11_HUMAN 


NV-8 


The new variant has a deletion of 1 17 
amino acids after residue 116 
(between 1 16-233). The new variant is 
missing one potential 
GLYCOSILATION site (out of 5 
sites). 


KPCT_HIJMAN 


NV-9 


The new variant has an alternative 3' 
exon of 3 amino acids instead of 94 
ammo acids. The alternative region is 
in the protein kinase domain. The new 
variant maintains the two 
PHORBOL-ESTER AND DAG 
BINDING domains, the two ATP 
binding sites and the active site or the 
kinase domain. 


IRF1_HUMAN 


NV-10 


The new variant has an altemative 3' 
exon of 7 amino acids instead of 40 
amino acids. The new variant 

rriiitntciinc tV*p» T^^TA HinHinO" HfMTlJliri 
£Ila.lIlLalIlD LlIC LJl^rA. umu-liig u.willa.iii. 


FGR1_HUMAN 


NV-11 


The new variant has an altemative 3' 
exon of 14 amino acids instead of 134 
amino acids. The new variant has the 
entire extracellular domain and the 
TM, it is missing part of the 
cytoplasmic domain. The new variant 



-25- 







maintains all 3 IMMUNO- 

GLOBULIN-LIKE DOMAINS, the 
protein KINASE domain, the 
ACTIVE site, and the 2 ATP binding 
sites, but it might be missing one of 
the two PHOSPHORYLATION 
(AUTO-) sites. 


APE1_HUMAN 


NV-12 


The new variant has a gap of 22 amino 
acids between residues 146 — 169. The 
new variant maintains the active site 
and site important for substrate 
recognition. 


APE1_HUMAN 


NV-13 


The new variant has an insertion of 25 
amino acids after residue 18. The new 
variant maintains the active site and 
site important for substrate 
recognition. 


MAD3_HUMAN 


NV-14 


The new variant has an alternative 3' 
exon of 3 amino acids instead of 15 
amino acids. It retains all five ANK 
motifs and the two 
PHOSPHORYLATION sites. 


MAD3_HUMAN 


NV-15 


The new variant has a deletion of 28 
amino acids between 183 - 212. The 
deletion is in the ANK MOTIF 4. The 
new variant maintains 4 out of the five 
ANK MOTIFs and the two 
PHOSPHORYLATION sites. 


EPA4_HUMAN 


NV-16 


Deletion of 65aa after residue 832 
(832-898). Deletion in end of 
CYTOPLASMIC domain. The 3' end 
of the PROTEIN KINASE domain is 
missing, but all important sites are 
maintained, ine new variant nas two 
FYBRONECTIN TYPE III domains 
and the protein KINASE domain with 
2 ATP binding sites, an ACTIVE site 
and an auto PHOSPHORYLATION 
site. 
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ETS2_HUMAN 


NV-17 


The new variant has a deletion of 26 
aa between 87 - 114. The new variant 
maintains the DNA binding domain. 


WN5A HUMAN 
1. 


NV-18 


The new variant has an alternative 3' 
exon of 4 amino acids instead of 109. 
It is identical to the known protein 
until residue 256. Two 
GLYCOSILATION sites out of four 
are missing in the new variant. 


TY03_HUMAN 


NV-19 


The new variant has an alternative 3' 
exon of 45 amino acids instead of 216 
amino acids. The new variant is 
missing part of the PROTEESf 
KESTASE domain and its 
AUTOPHOSPHORYLATION site. 
However, it maintains all other 
necessary domains: the ACTIVE site 
and the two ATP binding sites. The 
variant retains all 6 
GLYCOSILATION sites, the 2 
IG-like domains and the 2 
FIBRONEXTIN TYPE III domains. 


CAD2_HUMAN 


NV-20 


The new variant has an alternative 3' 
exon of 10 amino acids instead of 68 
amino acids. The new variant 
maintains the extracellular domain and 
the TM domain. It is missing the end 
of the cytoplasmic domain and the 
SER-RICH domain. However, it has 
all other necessary domains including 
:5 CADHERIN REPEATS with 7 
GLYCOSILATION sites. 


MXI1_HUMAN 


NV-21 


NV_1 m85527_3 

Insertion of 24aa after residue 79. 
Most likely truncated in insertion. Has 
basic DNA binding domain, but lacks 
helix-loop-helix. 


MXI1_HUMAN 


NV-22 


NV_2 m85527_5 

Alternative 5' exon. Identical to 
known from aa 26 to the end. Has a 5' 
exon of 3 1 aa versus 25 aa of the 
known. 

Has both DNA binding domain and 
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helix loop helix. 

The alternative 5' exon bares a 
clathrin repeat. Supported by 4 ests. 


MPK3_HUMAN 


"NV-23 


O * * t J 1 -f— » "VTA J. j_ ^ AN /-\ 

Similar to known RNA at first 290 aa. 
Alternative 3' exon of 10 aa instead of 
28 The new variant maintains the 
PROTEEn kinase domain with its 
two ATP binding sites, the ACTIVE 
site and two POSPHORYLATION 
sites. 


XRC1_HUMAN 


NV-24 


First 242aa identical to known RNA. 
Alternative short 3* exon of 50 aa 
Instead of 391aa. 


XRC1_HUMAN 


NV-25 


Identical to known RNA in first 241 
aa. Alternative 3' exon of 25 aa 
instead of 392. 


XRC1_HUMAN 


NV-26 


Identical to known RNA in first 186 
aa. Alternative short 3* exon of length 
61 aa, instead of 447 aa. 


XRC1_HUMAN 


NV-27 


Identical to known RNA in first 540 
aa. Alternative 3' exon of 84 aa 
instead of 
93 aa. 


MERL_HUMAN 


NV-28 


Deletion of 29 aa firom position 333. 
The new variant retains the Band 4-1 
like domain. (Band 4.1, which links 
the spectrin-actin cytoskeleton of 
erythrocytes to the plasma 
membrane.) 


DP1_HUMAN 


NV-29 


Alternative 3' exon of 21 amino acids 
instead of 72 amino acids. The new 
variant retains the two transmembrane 
domains. 


MDR1_HUMAN 


NV-30 


Alternative exon at 3' end at 
cytoplasmic domain. 1 aa instead of 3 
of the known. Identical to known until 
aa 1277. 


MDR1_HUMAN 


NV-31 


The new variant is a truncated protein. 
It has an altemative 3' exon of 12 
amino acids instead of 713. It is 
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identical to the known protein until 
residue 567. The new variant retains 
only one out of two ATP binding sites, 
and six out of twelve TM domains. It 
has one out of three cytoplasmic 
domains and is truncated in the middle 
of the second cytoplasmic domain. 


MK08_HUMAN 


NV-32 


Identical to known until aa 205. 
Tmncated. Has additional 13 aa. 
Lacks part of the protein kinase 
domain. Retains the active site the two 
ATP binding sites and the two 
phosphorylation sites. 


MK08_HUMAN 


NV-33 


Alternative 3 exon of 14 aa instead of 
134 aa. Identical to known until 
residue 293. Lacks end of protein 
kinase domain. 

Retains the active site, the two ATP 
binding sites and the two 
phosphorylation sites. 


MK08_HUMAN 


NV-34 


Alternative 3 ' exon of 7 aa instead of 
95 aa. Identical to known until residue 
332. Has entire protein kinase domain 
including the active site, the two ATP 
binding sites and the two 
phosphorylation sites. 


MAPK12_HUMAN 


NV-35 


The new variant contains 152 
N-terminal amino acids of the original 
protein. The new variant has 
alternative 25 amino acids in its 
C-terminus, instead of original 215 
amino acids. It contains the NP_BIND 
(between the amino acids 33-41, and 
the ATP binding site at position 56. 
The truncated variant has only part of 
the kinase domain, it lacks the active 
site and both the phosphorylation sites 
that activates the kinase. . This 

CrUncaLCU bpilCC VariallL CoIl dUL uo 

dominant negative. 


KPCT_,HUMAN 


NV-36 


The new variant has an alternative 3' 
exon of 36 amino acids instead of 94 
original amino acids. The alternative 
region is in the PROTEIN KINASE 
domain. The new variant maintains 
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the two PHORBOL-ESTER AND 
DAG BINDING domains, the two 
ATP binding sites and the ACTIVE of 
the KINASE domain. 



The following is a list of the original tumor-involved sequences, followed 
by all the splice variants obtained therefrom with a list of differences between the 
5 original TIG sequence and the variant. 

KU(p70/p80) 
KU70_HUMAN 

10 

FUNCTION : SINGLE STRANDED DNA-DEPENDENT ATP-DEPENDENT 
HELICASE. HAS A ROLE IN CHROMOSOME TRANSLOCATION. THE 
DNA HELICASE II COMPLEX BINDS PREFERENTIALLY TO FORK-LIKE 
ENDS OF DOUBLE-STRANDED DNA IN A CELL CYCLE-DEPENDENT 
15 MANNER. IT WORKS IN THE 3'-5' DIRECTION. BINDING TO DNA MAY 
BE MEDIATED BY P70. 

SUBUNIT : HETERODIMER OF A 70 KD AND A 80 KD SUBUNIT. 
SUBCELLULAR LOCATION : NUCLEAR. 

PTM : PHOSPHORYLATED IN VIVO AT SERINE RESIDUES (BY 
20 SIMILARITY). 

DISEASE : INDIVIDUALS WITH SLE AND RELATED DISORDERS 
PRODUCE EXTREMELY LARGE AMOUNTS OF AUTOANTIBODIES TO 
P70 AND P86. EXISTENCE OF A MAJOR AUTOANTIGENIC EPITOPE OR 
EPITOPES ON THE CARBOXY TERMINAL 190 AMINO ACIDS OF P70 
25 CONTAINING THE LEUCINE REPEAT. THE MAJORITY OF 
AUTOANTIBODIES TO P70 IN MOST SERA FROM PATIENTS WITH SLE 
SEEM TO BE REACTIVE WITH THIS REGION. 

SIMILARITY : BELONGS TO THE ATP-DEPENDENT DNA HELICASE II 70 
KD SUBUNIT FAMILY. 

30 

NV 1 



The new variant has an alternative 3' exon of 5 amino acids instead of 240 
35 amino acids. It is probably missing the PHOSPHORYLATION (BY NUCLEAR 
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KINASE Nil) site and half of the PRO-RICH domain but retains the 
LEUCINE-ZIPPER domain. 

KU (p70/p80) 

5 

KU70_HUMAN 

NV_2 

lb The new variant has a deletion of 210 amino acids between residues 304 - 

515. The new variant lacks the PRO-RICH domain but retains the 
LEUCINE-ZIPPER domain and PHOSPHORYLATION (BY NUCLEAR 
KINASE Nil) site. 

15 LCK 

LCK_HUMAN 

PROTO-ONCOGENE TYROSINE-PROTEIN KINASE LCK 
20 FUNCTION : MAY PARTICIPATE IN ANTIGEN-INDUCED T-CELL 
ACTIVATION. 

CATALYTIC ACTIVITY : ATP + A PROTEIN TYROSINE = ADP + 
PROTEIN TYROSINE PHOSPHATE. 

ENZYME REGULATION : REGULATED BY PHOSPHORYLATION ON 
25 TYR-504. 

SUBCELLULAR LOCATION : BOUND TO THE CYTOPLASMIC DOMAIN 
OF EITHER CD4 OR CD 8. 
SIMILARITY : CONTAINS 1 SH2 DOMAIN. 
SIMILARITY : CONTAINS 1 SH3 DOMAIN. 
30 SIMILARITY : TO OTHER PROTEIN-TYROSINE KINASES IN THE 
CATALYTIC DOMAIN. BELONGS TO THE SRC SUBFAMILY. 

NY 3 



35 



The new variant has an alternative 3' exon of 45 amino acids instead of 
163 amino acids. The new variant retains both SH domains and most of the 
PROTEIN KINASE domain including two ATP BINDING sites and the 
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ACTIVE SITE. It is missing the 3' end of the PROTEIN KINASE domain and 
lacks the AUTO-PHOSPHORYLATION and PHOSPHORYLATION sites. 

LCK 

5 

LCK_HUMAN 

NV_4 

10 Insertion of 58 amino acids after amino acid 62 (insertion does not result 

in truncation). Insertion in first SH2 domain. The new variant retains all 
important sites including: the PROTEIN KINASE DOMAIN with two ATP 
BINDING sites, an ACTIVE SITE and an AUTO PHOSPHORYLATION site. 
An additional PHOSPHORYLATION site. 

15 

OSTEOPONTm 
OSTP_HUMAN 

20 FUNCTION : BINDS TIGHTLY TO HYDROXYAPATITE. APPEARS TO 
FORM AN INTEGRAL PART OF THE MINERALIZED MATRIX. 
PROBABLY IMPORTANT TO CELL-MATRIX INTERACTION. 
ALTERNATIVE PRODUCTS : TWO ISOFORMS; OPIA AND OPIB 
(SHOWN HERE); ARE PRODUCES BY ALTERNATIVE SPLICING. 

25 PTM : EXTENSIVELY PHOSPHORYLATED ON SERINE RESIDUES. 
PTM : N- AND O-GLYCOSYLATED. 

DISEASE : THIS PROTEIN PLAYS A PRINCIPAL ROLE IN URINARY 
STONE FORMATION AS THE STONE MATPOX 

30 NV_5 

The new variant has an alternative 3' exon of 12 amino acids instead of 
134 amino acids. The new variant maintains the CELL ATTACHMENT SITE 
and two GLYCOSILATION sites. 

35 
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GADD45 
GA45_HUMAN 

5 GROWTH ARREST AND DNA-DAMAGE-INDUCIBLE PROTEIN 

FUNCTION : INVOLVED IN THE REGULATION OF GROWTH AND 
APOPTOSIS. MEDIATES ACTIVATION OF STRESS-RESPONSIVE 
MTK1/MEKK4 MAPKKK. 

SIMILARITY: BELONGS TO THE GADD45 / MYD 1 1 8 FAMILY. 



NV_6 

The new variant has an alternative 5' exon of 72 amino acids instead of 
15 125 amino acids. The new veiriant has a signal peptide and has the two 
PHOSPHORYLATION (BY CK2) sites. 

WNT-11 PROTEIN 
20 WN11_HUMAN 

FUNCTION : PROBABLE DEVELOPMENTAL PROTEIN. MAY BE A 
SIGNALING MOLECULE WHICH AFFECT THE DEVELOPMENT OF 
DISCRETE REGIONS OF TISSUES. IS LIKELY TO SIGNAL OVER ONLY 
25 FEW CELL 
DIAMETERS. 

SUBCELLULAR LOCATION : POSSIBLY SECRETED AND ASSOCIATES 
WITH THE EXTRACELLULAR MATRIX. 
SIMILARITY : BELONGS TO THE WNT FAMILY 

30 

NV_7 

The new variant has a deletion of 22 amino acids after residue 312 
35 (between 312-334). The new variant has all five potential GLYCOSILATION 
sites. 
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WNT-11 PROTEIN 
WN11_HUMAN 

5 

NV_8 

The new varizint has a deletion of 117 amino acids after residue 116 
(between 1 16-233). The new variant is missing one potential GLYCOSILATION 
10 site (out of 5 sites). 

PROTEIN KINASE C, THETA TYPE 
KPCT_HUMAN 

15 FUNCTION : TfflS IS CALCIUM-INDEPENDENT, PHOSPHOLIPID- 
DEPENDENT, SERINE- AND THREONINE- SPECIFIC ENZYME. 
FUNCTION : PKC IS ACTIVATED BY DIACYLGLYCEROL WHICH IN 
TURN PHOSPHORYLATES A RANGE OF CELLULAR PROTEINS. PKC 
ALSO SERVES AS THE RECEPTOR FOR PHORBOL ESTERS, A CLASS OF 

20 TUMOR 

PROMOTERS. 

TISSUE SPECIFICITY : SKELETAL MUSCLE, MEGAKARYOBLASTIC 
CELLS AND PLATELETS. 

SIMILARITY : CONTAINS 2 ZINC-DEPENDENT PHORBOL-ESTER AND 
25 DAG BINDING DOMAINS. 

SIMILARITY : BELONGS TO THE SERn^HR FAMILY OF PROTEIN 
KINASES. PKC SUBFAMILY. 

NV_9 

30 The new variant has an alternative 3' exon of 3 amino acids instead of 94 

amino acids. The alternative region is in the PROTEIN KINASE domain. The 
new variant maintains the two PHORBOL-ESTER AND DAG BINDING 
domains, the two ATP binding sites and the ACTIVE of the KINASE domain. 



I. 
t 
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INTERFERON REGULATORY FACTOR 1 
IRF1_HUMAN 

FUNCTION : SPECIFICALLY BINDS TO THE UPSTREAM REGULATORY 
5 REGION OF TYPE I IFN AND IFN-INDUCIBLE MHC CLASS I GENES 
(THE INTERFERON CONSENSUS SEQUENCE (ICS)) AND ACTIVATES 
THOSE GENES. 

SUBCELLULAR LOCATION : NUCLEAR. 
INDUCTION : BY VIRUSES AND IFN. 
10 DISEASE : DELETION OR REARRANGEMENT OF IRFl ARE A CAUSE OF 
PRELEUKEMIC MYELODYSPLASTIC SYNDROME (MDS) AND OF 
ACUTE MYELOGENOUS LEUKEMIA (AML). 
SIMILARITY : BELONGS TO THE IRF FAMILY. 

15 NV_10 

The new variant has an alternative 3' exon of 7 amino acids instead of 40 
amino acids. The new variant maintains the DNA binding domain. 

BASIC FIBROBLAST GROWTH FACTOR RECEPTOR 1 
20 FGR1_HUMAN 

FUNCTION : RECEPTOR FOR BASIC FIBROBLAST GROWTH FACTOR. A 
SHORTER FORM OF THE RECEPTOR COULD BE A RECEPTOR FOR 
ACIDIC FGF (AFGF). 
25 CATALYTIC ACTIVITY : ATP + PROTEIN TYROSINE = ADP + PROTEIN 
TYROSINE PHOSPFIATE. 

SUBCELLULAR LOCATION : TYPE I MEMBRANE PROTEIN. 
ALTERNATIVE PRODUCTS : MANY FORMS OF FGFRl ARE PRODUCED 
BY ALTERNATIVE SPLICING. THE FORM SHOWN HERE IS KNOWN AS 
30 ALPHA-Al. 

DISEASE : DEFECTS IN FGFRl ARE ONE OF THE CAUSES OF PFEIFFER 
SYNDROME, ALSO CALLED ACROCEPHALOSYNDACTYLY TYPE V 
(ACS V), CHARACTERIZED BY CRANIOSYNOSTOSIS (PREMATURE 
FUSION OF THE SKULL SUTURES) WITH DEVIATION AND 
35 ENLARGEMENT OF THE THUMBS AND GREAT 
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TOES,BRACHYMESOPHALANGY, WITH PHALANGEAL ANKYLOSIS 
AND A VARYING DEGREE OF SOFT TISSUE SYNDACTYLY. 
SIMILARITY : BELONGS TO THE FIBROBLAST GROWTH FACTOR 
RECEPTOR FAMILY. 
5 SIMILARITY : CONTAINS 3 IMMUNO GLOBULIN-LIKE DOMAINS. 

NV_11 

The new variant has an alternative 3' exon of 14 amino acids instead of 
134 amino acids. The new variant has the entire extracellular domain and the TM, 
10 it is missing part of the cytoplasmic domain. The new variant maintains all 3 
IMMUNOGLOBULIN-LIKE DOMAINS, the protein KINASE domain, the 
ACTIVE site, and the 2 ATP binding sites, but it might be missing one of the two 
PHOSPHORYLATION (AUTO-) sites. 

1 5 REF-1 PROTEIN DNA-(APURINIC OR APYRIMIDINIC SITE) LYASE 

APE1_HUMAN 

FUNCTION : REPAIRS OXIDATIVE DNA DAMAGES IN VITRO. MAY 
HAVE A ROLE IN PROTECTION AGAINST CELL LETHALITY AND 
20 SUPPRESSION OF MUTATIONS. REMOVES THE BLOCKING GROUPS 
FROM THE 3' 

TERMINI OF THE DNA STRAND BREAKS GENERATED BY IONIZING 
RADIATIONS AND BLEOMYCIN. 

CATALYTIC ACTIVITY : ENDONUCLEOLYTIC CLEAVAGE NEAR 
25 APURINIC OR APYRIMIDINIC SITES TO PRODUCTS WITH 
5*-PHOSPHATE. 

SUBCELLULAR LOCATION : NUCLEAR. 

SIMILARITY : BELONGS TO THE AP/EXOA FAMILY OF DNA REPAIR 
ENZYMES. 



30 
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NV_12 

The new variant has a gap of 22 amino acids between residues 146 - 169. 
The new variant maintains the ACTIVE site and site important for substrate 
recognition. 

5 

NV_13 

The new variant has an insertion of 25 amino acids after residue 18. It 
maintains the ACTIVE site and the site important for substrate recognition. 

1 0 MAD3 MAJOR HISTOCOMPATIBILITY COMPLEX 

ENHANCER-BINDING PROTEIN 

MAD3_HUMAN 

15 FUNCTION : I-KAPPA-B-LIKE ACTIVITY. MAY BE INVOLVED IN 
REGULATION OF TRANSCRIPTIONAL RESPONSES TO NF-KAPPA-B, 
INCLUDING ADHESION- DEPENDENT PATHWAYS OF MONOCYTE 
ACTIVATION. INTERACTS DIRECTLY WITH THE NF-KAPPA-B 
COMPLEX, PRESUMABLY THROUGH THE P65 SUBUNIT. 

20 INDUCTION : INDUCED IN ADHERENT MONOCYTES . 

PTM : PHOSPHORYLATION OF I-KAPPA-B BLOCKS ITS ABILITY TO 
INHIBIT NF-KAPPA-B DNA-BINDING ACTIVITY. 
SIMILARITY : CONTAINS 5 ANK REPEATS. 

25 NV_14 

The new variant has an alternative 3' exon of 3 amino acids instead of 15 
amino acids. It retains all five ANK motifs and the two PHOSPHORYLATION 
sites. 

30 

NV_15 

The new variant has a deletion of 28 amino acids between 183 - 212. The 
deletion is in the ANK MOTIF 4.The new variant maintains 4 out of the five 
35 ANK MOTIFS and the two PHORYLATION sites. 
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RECEPTOR PROTEIN-TYROSiNE KINASE HEK8 
EPA4_HUMAN 

^ FUNCTION : RECEPTOR FOR MEMBERS OF THE EPHRIN-A FAMILY. 
BINDS TO EPHRIN-A 1, -A4 AND -A5. BINDS MORE POORLY TO 
EPHRIN-A2 AND A-3. 

CATALYTIC ACTIVITY : ATP + A PROTEIN TYROSINE = ADP + 
10 PROTEIN TYROSINE PHOSPHATE. 

SUBCELLULAR LOCATION : TYPE I MEMBRANE PROTEIN. 
SIMILARITY : CONTAINS 2 FIBRONECTIN TYPE III-LIKE DOMAINS. 
SIMILARITY : TO OTHER PROTEIN-TYROSINE KINASES IN THE 
CATALYTIC DOMAIN. BELONGS TO THE EPHRIN RECEPTOR FAMILY. 

15 

NV_16 

Deletion of 65 amino acids between 832 - 898. The deletion in the 
cytoplasmic domain. The 3' end of the PROTEIN KINASE domain is missing, 
but all important sites are maintained. The new variant has two FYBRONECTIN 
20 TYPE III domains and the protein KINASE domain with 2 ATP binding sites, an 
ACTIVE site and an auto PHOSPHORYLATION site. 

C-ETS-i2 PROTEIN 
ETS2_HUMAN 

25 

SUBCELLULAR LOCATION : NUCLEAR. 
SIMILARITY: BELONGS TO THE ETS FAMILY. 



NV_17 

The new variant has a deletion of 26 amino acids between 87 - 114. The 
new variant maintains the DNA binding domain. 
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WNT-5A PROTEIN 
WN5A_HUMAN 

FUNCTION : PROBABLE DEVELOPMENTAL PROTEIN. MAY BE A 
5 SIGNALING MOLECULE WHICH AFFECTS THE DEVELOPMENT OF 
DISCRETE REGIONS OF TISSUES. IS LIKELY TO SIGNAL OVER 
ONLY FEW CELL DIAMETERS. 

SUBCELLULAR LOCATION : POSSIBLY SECRETED AND ASSOCIATES 
WITH THE EXTRACELLULAR MATRIX. 
10 SIMILARITY : BELONGS TO THE WNT FAMILY 

NV_18 

The new variant has an alternative 3' exon of 4 amino acids instead of 
109. It is identical to the known protein until residue 256. Two 
15 GLYCOSILATION sites out of four are missing in the new variant. 

TYROSINE-FROTEIN KINASE SKY 
TY03_HUMAN 

20 FUNCTION : MAY BE INVOLVED IN CELL ADHESION PROCESSES, 

PARTICULARLY IN THE CENTRAL NERVOUS SYSTEM. 

SUBCELLULAR LOCATION : TYPE I MEMBRANE PROTEIN. 

TISSUE SPECIFICITY : ABUNDANT IN THE BRAIN AND LOWER LEVELS 

IN OTHER TISSUES. 
25 SIMILARITY : TO OTHER PROTEIN-TYROSINE KINASES IN THE 

CATALYTIC DOMAIN. 

SIMILARITY : CONTAINS 2 IMMUNOGLOBULIN-LIKE C2-TYPE 
DOMAINS. 

SIMILARITY : CONTAINS 2 FIBRONECTIN TYPE III-LIKE DOMAINS. 

30 
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NV_19 

The new variant has an alternative 3 ' exon of 45 amino acids instead of 
216 amino acids. The new variant is missing part of the PROTEIN KINASE 
domain and its AUTOPHOSPHORYLATION site. However, it maintains all 
5 other necessary domains: the ACTIVE site and the two ATP binding sites. The 
variant retains all 6 GLYCOSILATION sites, the 2 IG-like domains and the 2 
FIBRONEXTIN TYPE III domains. 

NEURAL-CADHERIN 
10 CAD2_HUMAN 

FUNCTION : CADHERINS ARE CALCIUM DEPENDENT CELL ADHESION 
PROTEINS. THEY PREFERENTIALLY INTERACT WITH THEMSELVES 
IN A HOMOPHILIC MANNER IN CONNECTING CELLS; CADHERINS 
15 MAY THUS CONTRIBUTE TO THE SORTING OF HETEROGENEOUS 
CELL TYPES. N-CADHERIN MAY BE INVOLVED IN NEURONAL 
RECOGNITION MECHANISM. 

SUBCELLULAR LOCATION : TYPE I MEMBRANE PROTEIN. 
SIMILARITY : BELONGS TO THE CADHERIN FAMILY. 

20 

NV_20 

The new variant has an alternative 3' exon of 10 amino acids instead of 68 
amino acids. The new variant maintains the extracellular domain and the TM 
domain. It is missing the end of the cytoplasmic domain and the SER-RICH 
25 domain. However, it has all other necessary domains including: 5 CADHERIN 
REPEATS with 7 GLYCOSILATION sites. 

MXIl MAX INTERACTING PROTEIN 1 
MXI1_HUMAN 

30 

FUNCTION : TRANSCRIPTIONAL REPRESSOR. MXIl BINDS WITH MAX 
TO FORM A SEQUENCE-SPECIFIC DNA-BINDING PROTEIN COMPLEX 
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WHICH RECOGNIZES THE CORE SEQUENCE 5'-CAC[GA]TG-3'. MXII 
THUS ANTAGONIZES MYC TRANSCRIPTIONAL ACTIVITY BY 
COMPETING FOR MAX. 

SUBUNIT : EFFICIENT DNA BINDING REQUIRES DIMERIZATION WITH 
5 ANOTHER BHLH PROTEIN.BINDS DNA AS A HETERODIMER WITH 
MAX. 

ST mCRTLULAR LOCATION : NUCLEAR. 

TISSUE SPECIFICITY : HIGH LEVELS FOUND IN THE BRAIN, HEART 
AND LUNG WHILE LOWER LEVELS ARE SEEN IN THE LIVER, KIDNEY 
10 AND SKELETAL MUSCLE. 

DISEASE : DEFECTS IN MXII ARE FOUND IN SOME PATIENTS WITH 
PROSTATE TUMORS. 

SIMILARITY : BELONGS TO THE BASIC HELIX-LOOP-HELIX (BHLH) 
FAMILY OF TRANSCRIPTION FACTORS. 

15 

NV_21 

The new variant has an insertion of 24 amino acids after residue 79. It is 
most likely truncated within the insertion. The new variant retains the BASIC 
DNA BINDING domain, but lacks the HELIX LOOP HELIX motif. 

20 

MXII MAX INTERACTING PROTEIN 1 
MXI1_HUMAN 

NV_22 

25 The new variant has an alternative 5' exon of 31 amino acids instead of 

25. It is identical to the known protein from residue 26 to the end. The new 
variant has both the DNA BINDING DOMAIN and the HELIX LOOP HELIX 
motif. The alternative 5' exon bares a clathrin repeat. 



I 
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DUAL SPECIFICITY MITOGEN-ACTIVATED PROTEIN KINASE 

KINASE 3 
MPK3_HUMAN 

5 FUNCTION : DUAL SPECIFICITY KINASE. IS ACTIVATED BY 
CYTOKINES AND ENVIRONMENTAL STRESS IN VIVO. CATALYZES 
THE CONCOMITANT PHOSPHORYLATION OF A THREONINE AND A 
TYROSINE RESIDUE IN THE MAP KINASE P38. 

ENZYME REGULATION : ACTIVATED BY DUAL PHOSPHORYLATION 
10 ONSER-189 ANDTHR-193. 

TISSUE SPECIFICITY : ABUNDANT EXPRESSION IS SEEN IN THE 
SKELETAL MUSCLE. IT IS ALSO WIDELY EXPRESSED IN OTHER 
TISSUES. 

PTM : AUTOPHOSPHORYLATED. 
15 SIMILARITY : BELONGS TO THE SER/THR FAMILY OF PROTEIN 
KINASES. MAP KINASE KINASE SUBFAMILY. 

NV_23 

The new variant has an alternative 3' exon of 10 amino acids instead of 28 
20 amino acids. The new variant maintains the PROTEIN KINASE domain with its 
two ATP binding sites, the ACTIVE site and two POSPHORYLATION sites.It 
may lack a few amino acids at the end of the PROTEIN KINASE domain. 

DNA-REPAIR PROTEIN XRCCl 
25 XRC1_HUMAN 

FUNCTION : CORRECTS DEFECTIVE DNA STRAND-BREAK REPAIR 
AND SISTER CHROMATID EXCHANGE FOLLOWING TREATMENT 
WITH IONIZING RADIATION AND ALKYLATING AGENTS. 
30 SUBCELLULAR LOCATION : NUCLEAR (PROBABLE). 
SIMILARITY : SOME, TO S.POMBE RAD4/CUT5. 
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NV_24 

Alternative 3' exon of 50 amino acids instead of 391 amino acids. 

DNA-REPAIR PROTEIN XRCCl 
5 XRC1_HUMAN 

NV_25 

Alternative 3' exon of 25 amino acids instead of 392 amino acids. 

1 0 DNA-REPAIR PROTEIN XRCCl 

XRC1_HUMAN 

NV_26 

Alternative 3' exon of 61 amino acids instead of 447 amino acids. 

15 

DNA-REPAIR PROTEIN XRCCl 
XRC1_HUMAN 

NV_27 

20 Alternative 3' exon of 84 amino acids instead of 93 amino acids. 

MERLIN SCHWANNOMIN (NF2) 
MERL_HUMAN 

25 FUNCTION : PROBABLY ACTS AS A MEMBRANE STABILIZING 
PROTEIN. 

TISSUE SPECIFICITY: IN FETAL BRAIN; IN KIDNEY, LUNG, BREAST, 
AND OVARY. 

DISEASE : NEUROFIBROMATOSIS 2 (NF2) OR CENTRAL 
30 NEUROFIBROMATOSIS IS A GENETIC DISORDER CHARACTERIZED 
BY BILATERAL VESTIBULAR SCHWANNOMAS (FORMERLY CALLED 



ACOUSTIC NEUROMAS), SCHWANNOMAS OF OTHER CRANIAL AND 
PERIPHERAL NERVES,MENINGIOMAS, AND EPENDYMOMAS. IT IS 
INHERITED IN AN AUTOSOMAL DOMINANT FASHION WITH FULL 
PENETRANCE. AFFECTED INDIVIDUALS GENERALLY DEVELOP 
SYMPTOMS 

OF EIGHTH-NERVE DYSFUNCTION IN EARLY ADULTHOOD, 
INCLUDING DEAFNESS AND BALANCE DISORDER. ALTHOUGH THE 
TUMORS OF NF2 ARE HISTOLOGICALLY BENIGN, THEIR ANATOMIC 
LOCATION MAKES MANAGEMENT DIFFICULT, AND PATIENTS 
SUFFER GREAT MORBIDITY AND MORTALITY. 

SIMILARITY : CONTAINS A DOMAIN FOUND IN BAND 4.1, EZRIN, 
MOESIN, RADIXIN, AND TALIN. 

NV_28 

The new vjiriant has a deletion of 29 amino acids after residue 33 3. The 
new variant maintains the BAND 4,1 - LIKE domain. (Band 4.1, which links the 
spectrin-actin cytoskeleton of erythrocytes to the plasma membrane). 

DPI POLYPOSIS LOCUS PROTEIN 1 
DP1_HUMAN 

SUBCELLULAR LOCATION : INTEGRAL MEMBRANE PROTEIN 
(POTENTIAL). 

SIMILARITY : TO C.ELEGANS T19C3.4. 
NV_29 

Alternative 3' exon of 21 amino acids instead of 72 amino acids. The new 
variant maintains the two transmembrane domains. 



MDRl MULTIDRUG RESISTANCE PROTEIN 1 
MDR1_HUMAN 

FUNCTION : ENERGY-DEPENDENT EFFLUX PUMP RESPONSIBLE FOR 
DECREASED DRUG ACCUMULATION IN MULTIDRUG-RESISTANT 
CELLS. 

SUBCELLULAR LOCATION : INTEGRAL MEMBRANE PROTEIN. 
SIMILARITY : BELONGS TO THE ATP-BINDING TRANSPORT PROTEIN 
FAMILY (ABC TRANSPORTERS). MDR SUBFAMILY. 

NV_30 

The new variant has an alternative 3' exon of 1 amino acid, instead of 3 
of the known protein. The new variant is identical to the known protein until 
residue 1277. It maintains all important sites. 

MDRl MULTIDRUG RESISTANCE PROTEIN 1 
MDR1_HUMAN 

NV_31 

The new variant is a truncated protein. It has an alternative 3' exon of 12 
amino acids instead of 713. It is identical to the known protein until residue 567. 
The new variant retains only one out of two ATP binding sites, and six out of 
twelve TM domains. It has one out of three cytoplasmic domains and is truncated 
in the middle of the second cytoplasmic domain. 

JNKl - MITOGEN ACTIVATED PROTEIN KINASE 8 

MK08_HUMAN 

FUNCTION : PROBABLY PLAYS A ROLE IN THE SIGNAL 
TRANSDUCTION PATHWAY INITIATED BY PROINFLAMMATORY 
CYTOKINES AND UV RADIATION. BINDS TO THE N-TERMINAL 
ACTIVATION DOMAINS OF C-JUN AND ATF2 AND PHOSPHORYLATES 
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THEIR REGULATORY SITES (RESPECTIVELY SER-63 AND SER-73; 

THR-69 AND THR-71). JNKl ISOFORMS DISPLAY DIFFERENT BINDING 

PATTERNS: BETA-1 PREFERENTIALLY BINDS TO C-JUN, WHEREAS 

ALPHA- 1, ALPHA-2, AND BETA-2 HAVE A SIMILAR LOW LEVEL OF 
5 BINDING TO BOTH C-JUN. OR ATF2. HOWEVER THERE IS NO 

CORRELATION BETWEEN BINDING AND PHOSPHORYLATION, WHICH 

IS ACHIEVED ABOUT AT THE SAME EFFICIENCY BY ALL ISOFORMS. 

ENZYME REGULATION : ACTIVATED BY THREONINE AND TYROSINE 

PHOSPHORYLATION. 
10 ALTERNATIVE PRODUCTS : FOUR ISOFORMS JNKl ALPHA- 1, JNKl 

ALPHA-2 (SHOWN HERE), JNKl BETA-1, AND JNKl BETA-2 

ARE PRODUCED BY ALTERNATIVE SPLICING. 

INDUCTION : BY UV LIGHT, INTERLEUKIN- 1 AND BY HA-RAS. 

SIMILARITY : BELONGS TO THE CDC2/CDC28 SUBFAMILY OF SER/THR 
15 PROTEIN KINASES. STRONGEST SIMILARITY WITH OTHER MAP 

KINASES. 

NV_32 

The new variant is a truncated protein. It has an alternative 3' exon of 13 
20 amino acids instead of 222 amino acids. It is identical to the known protein until 
residue 205. The new variant lacks part of the PROTEIN KINASE domain, 
however it retains the ACTIVE SITE, the two ATP binding sites and the two 
PHOSPHORYLATION sites. 

25 JNKl - MITOGEN ACTIVATED PROTEIN KINASE 8 

MK08_HUMAN 

NV_33 

The new variant has an alternative 3' exon of 14 amino acids instead of 
30 134 amino acids of the known protein. It is identical to the known protein until 
residue 293. The new variant lacks the end of the PROTEIN KINASE domain, 
but retains the ACTIVE SITE, the two ATP binding sites and the two 
PHOSPHORYLATION sites. 
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JNKl - MITOGEN ACTIVATED PROTEIN KINASE 8 

MK08_HUMAN 

5 

NV_34 

The new variant has an alternative 3' axon of 7 amino acids a instead of 
95 amino acids. It is identical to the known protein until residue 332. Has the 
entire PROTEIN KINASE domain including the ACTIVE SITE, the two ATP 
1 0 binding sites and the two PHOSPHORYLATION sites. 

MITOGEN-ACTIVATED PROTEIN KINASE 12 

Synonym(s) EC 2.7.1- extracellular signal-regulated kinase 6 EC 2.7.1- 
15 ERK 6 ERK 5 stress-activated protein kinase-3 mitogen-activated protein kinase 
P38 gamma map kinase P38 gamma 

Gene name(s) MAPK12 or ERK 6 or SAPK 3 

FUNCTION : PHOSPHORYLATES MYELIN BASIC PROTEIN (MBP); ACTS 
20 AS SIGNAL TRANSDUCER DURING THE DIFFERENTL^TION OF 

MYOBLASTS TO MYOTUBES. OVEREXPRESSION ENHANCES THIS 

DIFFERENTIATION EVENT, WHEREAS INACTIVATION EXHIBITS IT 

AND MAINTAINS THE CELLS IN A PROLIFERATIVE STATE. 

ENZYME REGULATION : ACTIVATED BY PHSOPHORYLATION ON 
25 THREONINE AND TYROSINE (BY SIMILARITY). 

TISSUE SPECIFICITY : HIGHLY EXPRESSED IN SKELETAL MUSCLE. 

SIMILARITY : BELONGS TO THE SERHUR FAMILY OF PROTEIN 

KINASES . MAP KINASE FAMILY. 
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NV_35 

The new variant contains 152 N-terminal amino acids of the original 
protein. The new variant has alternative 25 amino acids in its C-terminus, instead 
of original 215 amino acids. It contains the NP BIND (between the amino acids 
5 33 — 41, and the ATP binding site at position 56. The truncated variant has only 
part of the kinase domain, it lacks the active site and both the phosphorylation 
sites that activates the kinase. This truncated splice variant can act as dominant 
negative. 

10 PROTEIN KINASE C, THETA TYPE 

KPCT_HUMAN 

FUNCTION : THIS IS CALCIUM-INDEPENDENT, PHOSPHOLIPID- 
DEPENDENT, SERINE- AND THREONINE-SPECIFIC ENZYME. 
15 FUNCTION : PKC IS ACTIVATED BY DIACYLGLYCEROL WHICH IN 
TURN PHOSPHORYLATES A RANGE OF CELLULAR PROTEINS! PKC 
ALSO SERVES AS THE RECEPTOR FOR PHORBOL ESTERS, A CLASS OF 
TUMOR PROMOTERS. 

TISSUE SPECIFICITY : SKELETAL MUSCLE, MEGAKARYOBLASTIC 
20 CELLS AND PLATELETS. 

SIMILARITY : CONTAINS 2 ZINC -DEPENDENT PHORBOL-ESTER 
AND DAG BINDING DOMAINS. 

SIMILARITY : BELONGS TO THE SER/THR FAMILY OF PROTEIN 
KINASES. PCK SUBFAMILY. 

25 

NV_36 

The new variant has an alternative 3' exon of 36 amino acids instead of 94 
original amino acids. The alternative region is in the PROTEIN KINASE domain. 
The new variant maintains the two PHORBOL-ESTER AND DAG BINDING 
30 domains, the two ATP binding sites and the ACTIVE of the KINASE domain. 
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Example II: Variant nucleic acid sequence 

The nucleic acid sequences of the invention include nucleic acid 
sequences which encode variant product and fragments and analogs thereof. The 
5 nucleic acid sequences may alternatively be sequences complementary to the 
above coding sequence, or to a region of said coding sequence. The length of the 
complementary sequence is sufficient to avoid the expression of the coding 
sequence. The nucleic acid sequences may be in the form of RNA or in the form 
of DNA, and include messenger RNA, synthetic RNA and DNA, cDNA, and 

10 genomic DNA. The DNA may be double-stranded or single-stranded, and if 
single-stranded may be the coding strand or the non-coding (anti-sense, 
complementary) strand. The nucleic acid sequences may also both include 
dNTPs, rNTPs as well as non naturally occurring sequences. The sequence may 
also be a part of a hybrid between an amino acid sequence and a nucleic acid 

15 sequence. 

In a general embodiment, the nucleic acid sequence has at least 90%, 
identity with any one of the sequence identified as SEQ ID NO: 1 to SEQ ID 
NO: 36 provided that this sequence is not completely identical with that of the 
original sequence. 

20 The nucleic acid sequences may include the coding sequence by itself. By 

another alternative the coding region may be in combination with additional 
coding sequences, such as those coding for fusion protein or signal peptides, in 
combination with non-coding sequences, such as introns and control elements, 
promoter and terminator elements or 5' and/or 3' untranslated regions, effective 

25 for expression of the coding sequence in a suitable, host, and/or in a vector or host 
environment in which the variant nucleic acid sequence is introduced as a 
heterologous sequence. 

The nucleic acid sequences of the present invention may also have the 
product coding sequence fused in-frame to a marker sequence which allows for 

30 purification of the variant product. The marker sequence may be, for example, a 



-49- 

hexahistidine tag to provide for purification of the mature polypeptide fused to 
the marker in the case of a bacterial host, or, the marker sequence may be a 
hemagglutinin (HA) tag when a memimalian host, e.g. COS-7 cells, is used. The 
HA tag corresponds to an epitope derived from the influenza hemagglutinin 
5 protein (Wilson, L, et ai Cell 37:767 (1984)). 

Also included in the scope of the invention are fragments as defined above 
also referred to herein as oligonucleotides, typically having at least 20 bases, 
preferably 20-30 bases corresponding to a region of the coding-sequence nucleic 
acid sequence. The fragments may be used as probes, primers, and when 
10 complementary also as antisense agents, and the like, according to known 
methods. 

As indicated above, the nucleic acid sequence may be substantially a 
depicted in any one of SEQ ID NO: 1 to SEQ ID NO: 36 or fragments thereof or 
sequences having at least 90% identity to the above sequence as explained above. 
15 Alternatively, due to the degenerative nature of the genetic code, the sequence 
may be a sequence coding for any one of the amino acid sequence of SEQ ID 
NO: 37 to SEQ ID NO: 72, or fragments or analogs of said amino acid sequence. 

A. Preparation of nucleic acid sequences 

20 The nucleic acid sequences may be obtained by screening cDNA libraries 

using oligonucleotide probes which can hybridize to or PCR-amplify nucleic acid 
sequences which encode the variant products disclosed above. cDNA libraries 
prepared from a variety of tissues are commercially available and procedures for 
screening and isolating cDNA clones are well-known to those of skill in the art. 

25 Such techniques are described in, for example, Sambrook et al (1989) Molecular 
Cloning: A Laboratory Manual (2nd Edition), Cold Spring Harbor Press, 
Plainview, N.Y. and Ausubel FM et al. (1989) Current Protocols in Molecular 
Biology, John Wiley & Sons, New York, N.Y. 

The nucleic acid sequences may be extended to obtain upstream and 

30 downstream sequences such as promoters, regulatory elements, and 5' and 3' 
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untranslated regions (UTRs). Extension of the available transcript sequence may 
be performed by numerous methods known to those of skill in the art, such as 
PGR or pr4mer extension (Sambrook et al, supra), or by the RACE method 
using, for example, the Marathon RACE kit (Clontech, Cat. # K1802-1). 
5 Alternatively, the technique of '^restriction-site" PCR (Gobinda et al. PCR 

Methods Applic, 2:318-22, (1993)), which uses universal primers to retrieve 
flanking sequence adjacent a known locus, may be employed. First, genomic 
DNA is amplified in the presence of primer to a linker sequence and a primer 
specific to the known region. The amplified sequences are subjected to a second 

10 round of PCR with the same linker primer and another specific primer internal to 
the first one. Products of each round of PCR are transcribed with an appropriate 
RNA polymerase and sequenced using reverse transcriptase. 

Inverse PCR can be used to amplify or extend sequences using divergent 
primers based on a known region (Triglia, T. et al. Nucleic Acids Res. 16:8186, 

15 (1988)). The primers may be designed using OLIGO(R) 4.06 Primer Analysis 
Software (1992; National Biosciences Inc, Plymouth, Mirm.), or another 
appropriate program, to be 22-30 nucleotides in length, to have a GC content of 
50% or more, and to anneal to the target sequence at temperatures about 68-72°C. 
The method uses several restriction enzymes to generate a suitable fragment in 

2b the known region of a gene. The fi-agment is then circularized by intramolecular 
ligation and used as a PCR template. 

Capture PCR (Lagerstrom, M. et al, PCR Methods Applic, 1:111-19, 
(1991)) is a method for PCR ampUfication of DNA fragments adjacent to a 
known sequence in human and yeast artificial chromosome DNA. Capture PCR 

25 also requires multiple restriction enzyme digestions and ligations to place an 
engineered double-stranded sequence into a flanking part of the DNA molecule 
before PCR. 

Another method which may be used to retrieve flanking sequences is that 
of Parker, J.D., et al. Nucleic Acids Res., 19:3055-60, (1991)). Additionally, one 
30 can use PCR, nested primers and Promoter? inder™ libraries to "walk in" genomic 
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DNA (PromoterFinderTM; Clontech, Palo Alto, CA). This process avoids the need 
to screen libraries and is useful in finding intron/exon junctions. Preferred 
libraries for screening for full length cDNAs are ones that have been size-selected 
to include larger cDNAs. Also, random primed libraries are preferred in that they 

5 will contain more sequences which contain the 5* and upstream regions of genes. 

A remdomly primed library may be particularly useful if an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries are useful for 
extension into the 5' nontranslated regulatory region. 

The nucleic acid sequences and oligonucleotides of the invention can also 

10 be prepared by solid-phase methods, according to known synthetic methods. 
Typically, fragments of up to about 100 bases are individually synthesized, then 
joined to form continuous sequences up to several hundred bases. 

B. Use of variant nucleic acid sequence for the production of 
15 variant products 

In accordance with the present invention, nucleic acid sequences specified 
above may be used as recombinant DNA molecules that direct the expression of 
variant products. 

20 As will be understood by those of skill in the art, it may be advantageous 

to produce variant product-encoding nucleotide sequences possessing codons 
other than those which appear in any one of SEQ ID NO: 1 to SEQ ID NO: 36 
which are those which naturally occur in the human genome. Codons preferred 
by a particular prokaryotic or eukaryotic host (Murray, E. et al Nuc Acids Res., 

25 17:477-508, (1989)) can be selected, for example, to increase the rate of variant 
product expression or to produce recombinant RNA transcripts having desirable 
properties, such as a longer half-life, than transcripts produced from naturally 
occurring sequence. 

The nucleic acid sequences of the present invention can be engineered in 

30 order to alter a variant product coding sequence for a variety of reasons, 
including but not limited to, alterations which modify the cloning, processing 
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and/or expression of the product. For example, alterations may be introduced 
using techniques which are well known in the art, e.g., site-directed mutagenesis, 
to insert new restriction sites, to alter glycosylation pattems, to change-codon 
preference, etc. 

5 The present invention also includes recombinant constructs comprising 

one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 
sequence of the invention has been inserted, in a forward or reverse orientation. 
In a preferred aspect of this embodiment, the construct further comprises 

10 regulatory sequences, including, for example, a promoter, operably linked to the 
sequence. Large numbers of suitable vectors and promoters are known to those 
of skill in the art, and are commercially available. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are also 
described in Sambrook, et al, (supra). 

15 The present invention also relates to host cells which are genetically 

engineered with vectors of the invention, and the production of the product of the 
invention by recombinant techniques. Host cells are genetically engineered (i.e., 
transduced, transformed or transfected) with the vectors of this invention which 
may be, for example, a cloning vector or an expression vector. The vector may 

20 be, for example, in the form of a plasmid, a viral particle, a phage, etc. The 
engineered host cells can be cultured in conventional nutrient media modified as 
appropriate for activating promoters, selecting transformants or amplifying the 
expression of the variant nucleic acid sequence. The culture conditions, such as 
temperature, pH and the like, are those previously used with the host cell selected 

25 for expression, and will be apparent to those skilled in the art. 

The nucleic acid sequences of the present invention may be included in 
any one of a variety of expression vectors for expressing a product. Such vectors 
include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast 

30 plasmids; vectors derived from combinations of plasmids and phage DNA, viral 
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DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, 
any other vector may be used as long as it is replicable and viable in the host. 
The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate 
5 restriction endonuclease site(s) by procedures known in the art. Such procedures 
and related sub-cloning procedures are deemed to be within the scope of those 
skilled in the art. 

The DNA sequence in the expression vector is operatively linked to an 
appropriate transcription control sequence (promoter) to direct mRNA synthesis. 

10 Examples of such promoters include: LTR or SV40 promoter, the E.coli lac or 
trp promoter, the phage lambda PL promoter, and other promoters known to 
control expression of genes in prokaryotic or eukaryotic cells or their viruses. 
The expression vector also contains a ribosome binding site for translation 
initiation, and a transcription terminator. The vector may also include 

15 appropriate sequences for amplifying expression. In addition, the expression 
vectors preferably contain one or more selectable marker genes to provide a 
phenotypic trait for selection of transformed host cells such as dihydrofolate 
reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E,colu 

20 The vector containing the appropriate DNA sequence as described above, 

as well as an appropriate promoter or control sequence, may be employed to 
transform an appropriate host to permit the host to express the protein. Examples 
of appropriate expression hosts include: bacterial cells, such as E.colU 
Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; insect cells 

25 such as Drosophila and Spodoptera Sf9; animal cells such as CHO, COS, HEK 
293 or Bowes melanoma; adenoviruses; plant cells, etc. The selection of an 
appropriate host is deemed to be within the scope of those skilled in the art from 
the teachings herein. The invention is not limited by the host cells employed. 

In bacterial systems, a number of expression vectors may be selected 

30 depending upon the use intended for the variant product. For example, when 
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large quantities of variant product are needed for the induction of antibodies, 
vectors which direct high level expression of fusion proteins that are readily 
purified may be desirable. Such vectors include, but are not limited to, 
multifunctional E.coli cloning and expression vectors such as Bluescript{K) 
5 (Stratagene), in which the variant polypeptide coding sequence may be ligated 
into the vector in-frame with sequences for the amino-terminal Met and the 
subsequent 7 residues of beta-galactosidase so that a hybrid protein is produced; 
pIN VQCtors (Van Heeke & Schuster J. BioL Chem. 264:5503-5509, (1989)); pET 
vectors (Novagen, Madison WI); and the like. 

10 In the yeast Saccharomyces cerevisiae a number of vectors containing 

constitutive or inducible promoters such as alpha factor, alcohol oxidase and 
PGH may be used. For reviews, see Ausubel et al. (supra) and Grant et al, 
{Methods in Enzymology 153:516-544, (1987)), 

In cases where plant expression vectors are used, the expression of a 

15 sequence encoding vciriant product may be driven by any of a number of 
promoters. For example, viral promoters such as the 35S and 19S promoters of 
CaMV (Brisson et al, Nature 310:511-514. (1984)) may be used alone or in 
combination with the omega leader sequence from TMV (Takamatsu et al, 
EMBO y., 6:307-311, (1987)), Alternatively, plant promoters such as the small 

20 subunit of RUBISCO (Coruzzi et al, EMBO J. 3:1671-1680, (1984); Broglie et 
ai. Science 224:838-843, (1984)); or heat shock promoters (Winter J and 
Sinibaldi R.M., Results Prohl Cell Differ., 17:85-105, (1991)) may be used. 
These constructs can be introduced into plant cells by direct DNA transformation 
or pathogen-mediated transfection. For reviews of such techniques, see Hobbs S. 

25 or Murry L.E. (1992) in McGraw Hill Yearbook of Science and Technology, 
McGraw Hill, New York, N.Y., pp 191-196; or Weissbach and Weissbach (1988) 
Methods for Plant Molecular Biology, Academic Press, New York, N.Y., pp 
421-463. 

Variant product may also be expressed in an insect system. In one such 
30 system, Autographa califomica nuclear polyhedrosis virus (AcNPV) is used as a 



vector to express foreign genes in Spodoptera friigiperda cells or in Trichoplnsia 
larvae. The variant product coding sequence may be cloned into a nonessential 
region of the virus, such as the polyhedrin gene, and placed under control of the 
polyhedrin promoter. Successful insertion of variant coding sequence will render 
5 the polyhedrin gene inactive and produce recombinant virus lacking coat protein 
coat. The recombinant viruses are then used to infect iS. frugiperda cells or 
Trichophisia larvae in which variant protein is expressed (Smith et aL, J, ViroL 
46:584, (1983); Engelhard, E.K. etal, Proc, Nat. Acad, ScL 91:3224-7, (1994)). 
In mammalian host cells, a number of viral-based expression systems may 

10 be utilized. In cases where an adenovirus is used as an expression vector, a 
variemt product coding sequence may be ligated into an adenovirus 
tremscription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a nonessential El or E3 region of the viral genome 
will result in a viable virus capable of expressing variant protein in infected host 

15 cells (Logan and Shenk, Proc, Natl Acad, Sci, 81:3655-59, (1984). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be 
used to increase expression in mammalian host cells. 

Specific initiation signals may also be required for efficient translation of 
a variant product coding sequence. These signals include the ATG initiation 

20 codon and adjacent sequences. In cases where variant product coding sequence, 
its initiation codon and upstream sequences are inserted into the appropriate 
expression vector, no additional translational control signals may be needed. 
However, in cases where only coding sequence, or a portion thereof, is inserted, 
exogenous tremscriptional control signals including the ATG initiation codon 

25 must be provided. Furthermore, the initiation codon must be in the correct 
reading frame to ensure transcription of the entire insert. Exogenous 
transcriptional elements and initiation codons can be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the 
inclusion of enhancers appropriate to the cell system in use (Scharf, D. et ai. 
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(1994) Results Probl, Cell Differ., 20:125-62, (1994); Bittner et al.. Methods in 
Enzymol 153:516-544, (1987)). 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher 
5 eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 
Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-Dextran mediated transfection, or electroporation 
(Davis, L., Dibner, M., and Battey, 1. (1986) Basic Methods in Molecular 
10 Biology). Cell-free translation systems can also be employed to produce 
polypeptides using RNAs derived from the DNA constructs of the present 
invention. 

A host cell strain may be chosen for its ability to modulate the expression 
of the inserted sequences or to process the expressed protein in the desired 

15 fashion. Such modifications of the protein include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorylation, lipidation and 
acylation. Post-translational processing which cleaves a ''pre-pro*' form of the 
protein may also be important for correct insertion, folding and/or function. 
Different host cells such as CHO, HeLa, MDCK, 293, WIS 8, etc. have specific 

20 cellular machinery and characteristic mechanisrris for such post-translational 
activities and may be chosen to. ensure the correct modification and processing of 
the introduced, foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. For example, cell lines which stably express variant 

25 product may be transformed using expression vectors which contain viral origins 
of replication or endogenous expression elements and a selectable marker gene. 
Following the introduction of the vector, cells may be allowed to grow for 1-2 
days in an enriched media before they are switched to selective media. The 
purpose of the selectable marker is to confer resistance to selection, and its 

30 presence allows growth and recovery of cells which successfully express the 

I 
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introduced sequences. Resistant clumps of stably transformed cells can be 
proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed ceil 
lines. These include, but are not limited to, the herpes simplex virus thymidine 

5 kinase (Wigler M., et ai. Cell 11:223-32, (1977)) and adenine 
phosphoribosyltransferase (Lowy I., et al. Cell 22:817-23, (1980)) genes which 
can be employed in tk- or aprt- cells, respectively. Also, antimetabolite, 
antibiotic or herbicide resistance can be used as the basis for selection; for 
example, dhfr v^hich confers resistance to methotrexate (Wigler M., et al, Proc. 

10 f^atl Acad. ScL 77:3567-70, (1980)); npt, which confers resistance to the 
aminoglycosides neomycin and G-418 (Colbere-Garapin, F. et ai, J. Mol. BioL, 
150:1-14, (1981)) and als or pat, which confer resistance to chlorsulfuron and 
phosphinotricin acetyltransferase, respectively (Murry, supra). Additional 
selectable genes have been described, for example, trpB, which allows cells to 

15 utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol 
in place of histidine (Hartman S.C. and R.C. Mulligan, Proc, Natl Acad, Sci. 
85:8047-51, (1988)). The use of visible markers has gained popularity with such 
markers as anthocyanins, beta-glucuronidase and its substrate, GUS, and 
luciferase and its substrates, luciferin and ATP, being widely used not only to 

20 identify trans formants, but also to quantify the amount of transient or stable 
protein expression attributable to a specific vector system (Rhodes, C.A. et al, 
Methods Mol Biol, 55:121-131,(1995)). 

Host cells transformed with a nucleotide sequence encoding variant 
product may be cultured under conditions suitable for the expression and 

25 recovery of the encoded protein from cell culture. The product produced by a 
recombinant cell may be secreted or contained intracellularly depending on the 
sequence and/or the vector used. As will be understood by those of skill in the 
art, expression vectors containing nucleic acid sequences encoding variant 
product can be designed with signal sequences which direct secretion of variant 

30 product through a prokaryotic or eukaryotic cell membrane. 
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The variant product may also be expressed as a recombinant protein with 
one or more additional polypeptide domains added to facilitate protein 
purification. Such purification facilitating domains include, but are not limited 
to, metal chelating peptides such as histidine-tryptophan modules that allow 
5 purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS 
extension/affinity purification system (Immunex Corp, Seattle, Wash.). The 
inclusion of a protease-cleavable polypeptide linker sequence between the 
purification domain and variant product is useful to facilitate purification. One 

10 such expression vector provides for expression of a fusion protein compromising 
a variant polypeptide fused to a polyhistidine region separated by an enterokinase 
cleavage site. The histidine residues facilitate purification on IMIAC 
(immobilized metal ion affinity chromatography, as described in Porath, et al. 
Protein Expression and Purification, 3:263-281, (1992)) while the enterokinase 

15 cleavage site provides a means for isolating variant polypeptide from the fusion 
protein. pGEX vectors (Promega, Madison, Wis.) may also be used to express 
foreign polypeptides as fusion proteins with glutathione S-transferase (GST). In 
general, such fusion proteins are soluble and can easily be purified from lysed 
cells by adsorption to ligand-agarose beads (e.g., glutathione-agarose in the case 

20 of GST-fusions) followed by elution.in the presence of free ligand. 

Following transformation of a suitable host strain and growth of the host 
strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. Cells are typically harvested by centrifugation, 

25 disrupted by physical or chemical means, and the resulting crude extract retained 
for further purification. Microbial cells employed in expression of proteins can 
be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents, or other methods, 
which are well know to those skilled in the art. 
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The variant products can be recovered and purified from recombinant cell 
cultures by any of a number of methods well known in the art, including 
ammonium sulfate or ethanol precipitation, acid extraction, anion or cation 
exchange chromatography, phosphocellulose chromatography, hydrophobic 
5 interaction chromatography, affinity chromatography, hydroxy lapatite 
chromatography, and lectin chromatography. Protein refolding steps can be used, 
as necessary, in completing configuration of the mature protein. Finally, high 
performance liquid chromatography (HPLC) can be employed for final 
purification steps. 

10 

C. Diagnostic applications utilizing nucleic acid sequences 

The nucleic acid sequences of the present invention may be used for a 
variety of diagnostic purposes. The nucleic acid sequences may be used to detect 
emd quantitate expression of the variant in patient's cells, e.g. biopsied tissues, by 

15 detecting the presence of mRNA coding for variant product. Alternatively, the 
assay may be used to detect soluble variant in the serum or blood. This assay 
typically involves obtaining total mRNA firom the tissue or serum and contacting 
the mRNA with a nucleic acid probe. The probe is a nucleic acid molecule of at 
least 20 nucleotides, preferably 20-30 nucleotides, capable of specifically 

20 hybridizing with a sequence included within the sequence of a nucleic acid 
molecule encoding variant product under hybridizing conditions, detecting the 
presence of mRNA hybridized to the probe, and thereby detecting the expression 
of variant. This assay can be used to distinguish between absence, presence, and 
excess expression of variant product and to monitor levels of variant expression 

25 during therapeutic intervention. In addition, the assay may be used to compare the 
levels of the variant of the invention to the levels of the original sequence fi-om 
which it has been varied or to levels of other variants, which comparison may 
have some physiological meaning. 

The invention also contemplates the use of the nucleic acid sequences as a 

30 diagnostic for diseases resulting from inherited defective variant sequences, or 
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diseases in which the ratio of the amount of the original sequence from which the 
variant was varied to the novel veiriants of the invention is altered. These 
sequences can be detected by comparing the sequences of the defective (i.e., 
mutant) variant coding region with that of a normal coding region. Association 
5 of the sequence coding for mutant variant product with abnormal variant product 
activity may be verified. In addition, sequences encoding mutant variant products 
can be inserted into a suitable vector for expression in a functional assay system 
(e.g., colorimetric assay, complementation experiments in a variant protein 
deficient strain of HEK293 cells) as yet another means to verify or identify 

10 mutations. Once mutant genes have been identified, one can then screen 
populations of interest for carriers of the mutant gene. 

Individuals carrying mutations in the nucleic acid sequence of the present 
invention may be detected at the DNA level by a variety of techniques. Nucleic 
acids used for diagnosis may be obtained from a patient's cells, including but not 

15 limited to such as from blood, urine, saliva, placenta, tissue biopsy and autopsy 
material. Genomic DNA may be used directly for detection or may be amplified 
enzymatically by using PGR (Saiki, et aL, Nature 324:163-166, (1986)) prior to 
analysis. RNA or cDNA may also be used for the same purpose. As an example, 
PGR primers complementary to the nucleic acid of the present invention can be 

20 used to identify and analyze mutations in the gene of the present invention. 
Deletions and insertions can be detected by a change in size of the amplified 
product in comparison to the normal genotype. 

Point mutations can be identified by hybridizing amplified DNA to 
radiolabeled RNA of the invention or alternatively, radiolabeled antisense DNA 

25 sequences of the invention. Sequence changes at specific locations may also be 
revealed by nuclease protection assays, such RNase and SI protection or the 
chemical cleavage method (e.g. Gotton, et alProc, Natl Acad. Set USA, 
85:4397-4401, (1985)), or by differences in melting temperatures. "Molecular 
beacons'' (Kostrikis L.G. et ai. Science 279:1228-1229, (1998)), hairpin-shaped, 

30 single-stranded synthetic oligonucleotides containing probe sequences which are 
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complementary to the nucleic acid of the present invention, may also be used to 
detect point mutations or other sequence changes as well as monitor expression 
levels of variant product. Such diagnostics would be particularly useful for 
prenatal testing. 

5 Another method for detecting mutations uses two DNA probes which are 

designed to hybridize to adjacent regions of a target, with abutting bases, where 
the region of known or suspected mutation(s) is at or near the abutting bases. 
The two probes may be joined at the abutting bases, e.g., in the presence of a 
ligase enzyme, but only if both probes are correctly base paired in the region of 

10 probe junction. The presence or absence of mutations is then detectable by the 
presence or absence of ligated probe. 

Also suitable for detecting mutations in the variant product coding 
sequence are oligonucleotide array methods based on sequencing by 
hybridization (SBH), as described, for example, in U.S. Patent No. 5,547,839. In 

15 a typical method, the DNA target analyte is hybridized with an array of 
oligonucleotides formed on a microchip. The sequence of the target can then be 
"read" from the pattem of target binding to the array. 

D. Gene mapping utilizing nucleic acid sequences 

20 The nucleic acid sequences of the present invention are also valuable for 

chromosome identification. The sequence is specifically targeted to and can 
hybridize with a particular location on an individual human chromosome. 
Moreover, there is a current need for identifying particular sites on the 
chromosome. Few chromosome marking reagents based on actual sequence data 

25 (repeat polymorphisms) are presently available for marking chromosomal 
location. The mapping of DNAs to chromosomes according to the present 
invention is an important first step in correlating those sequences with genes 
associated with disease. 

Briefly, sequences can be mapped to chromosomes by preparing PGR 

30 primers (preferably 20-30 bp) from the variant cDNA. Computer analysis of the 
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3' untranslated region is used to rapidly select primers that do not span more than 
one exon in the genomic DNA, which would complicate the amplification 
process. These primers are then used for PGR screening of somatic cell hybrids 
containing individual human chromosomes. Only those hybrids containing the 
5 human gene corresponding to the primer will yield an amplified fragment. 

PGR mapping of somatic cell hybrids or using instead radiation hybrids 
are rapid procedures for assigning a particular DNA to a particular chromosome. 
Using the present invention with the same oligonucleotide primers, 
sublocalization can be achieved with panels of fragments from specific 

10 chromosomes or pools of large genomic clones in an analogous manner. Other 
mapping strategies that can similarly be used to map to its chromosome include in 
situ hybridization, prescreening with labeled flow-sorted chromosomes and 
preselection by hybridization to construct chromosome specific-cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a cDNA clone to a metaphase 

15 chromosomal spread can be used to provide a precise chromosomal location in 
one step. This technique can be used with cDNA as short as 50 or 60 bases. For a 
review of this technique, see Verma et al, Human Chromosomes: a Manual of 
Basic Techniques, (1988) Pergamon Press, New York. 

Once a sequence has been mapped to a precise chromosomal location, the 

20 physical position of the sequence on the chromosome can be correlated with 
genetic map data. Such data are found, for example, in the OMIM database 
(Center for Medical Genetics, Johns Hopkins University, Baltimore, MD and 
National Center for Biotechnology Information, National Library of Medicine, 
Bethesda, MD). The OMIM gene map presents the cytogenetic map location of 

25 disease genes and other expressed genes. The OMIM database provides 
information on diseases associated with the chromosomal location. Such 
associations include the results of linkage analysis mapped to this interval, and 
the correlation of translocations and other chromosomal aberrations in this area 
with the advent of polygenic diseases, such as cancer, in general and prostate 

30 cancer in particular. 
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E. Therapeutic applications of nucleic acid sequences 

Nucleic acid sequences of the invention may also be used for therapeutic 
purposes. Turning first to the second aspect of the invention (i.e. inhibition of 
5 expression of variant), expression of variant product may be modulated through 
antisense technology, which controls gene expression through hybridization of 
complementary nucleic acid sequences, i.e. antisense DNA or RNA, to the 
control, 5' or regulatory regions of the gene encoding variant product. For 
example, the 5* coding portion of the nucleic acid sequence sequence which 

1 0 codes for the product of the present invention is used to design an antisense 
oligonucleotide of from about 10 to 40 base pairs in length. Oligonucleotides 
derived from the transcription start site, e.g. between positions -10 and +10 from 
the start site, are preferred. An antisense DNA oligonucleotide is designed to be 
complementary to a region of the nucleic acid sequence involved in transcription 

15 (Lee et al, Nucl Acids, Res., 6:3073, (1979); Cooney et al.. Science 241:456, 
(1988); and Dervan et al, Science 251:1360, (1991)), thereby preventing 
transcription and the production of the variant products. An antisense RNA 
oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the 
mRNA molecule into the variant products (Okano J, Neurochem, 56:560, 

20 (1991)). The antisense constructs can be delivered to cells by procedures known 
in the art such that the antisense RNA or DNA may be expressed in vivo. The 
antisense may be antisense mRNA or DNA sequence capable of coding such 
antisense mRNA. The antisense mRNA or the DNA coding thereof can be 
complementary to the full sequence of nucleic acid sequences coding for the 

25 variant protein or to a fragment of such a sequence which is sufficient to inhibit 
production of a protein product. 

Turning now to the first aspect of the invention, i.e. expression of variant, 
expression of variant product may be increased by providing coding sequences 
for coding for said product under the control of suitable control elements ending 

30 its expression in the desired host. 
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The nucleic acid sequences of the invention may be employed in 
combination with a suitable phamiaceutical carrier. Such compositions comprise 
a therapeutically effective amount of the compound, and a pharmaceutically 
acceptable carrier or excipient. Such a carrier includes but is not limited to saline, 
5 buffered saline, dextrose, water, glycerol, ethanol, and combinations thereof. The 
formulation should suit the mode of administration. 

The products of the invention as well as any activators and deactivators 
compounds (see below) which are polypeptides, may also be employed in 
accordance with the present invention by expression of such polypeptides in v/vo, 

10 which is often referred to as *'gene therapy." Cells from a patient may be 
engineered with a nucleic acid sequence (DNA or RNA) encoding a polypeptide 
ex vivo, with the engineered cells then being provided to a patient to be treated 
with the polypeptide. Such methods are well-known in the art. For example, cells 
may be engineered by procedures known in the art by use of a retroviral particle 

15 containing RNA encoding a polypeptide of the present invention. 

Similarly, cells may be engineered in vivo for expression of a polypeptide 
in vivo by procedures known in the art. As known in the art, a producer cell for 
producing a retroviral particle containing RNA encoding the polypeptide of the 
present invention may be administered to a patient for engineering cells in vivo 

20 and expression of the polypeptide in vivo. These and other methods for 
administering a product of the present invention by such method should be 
apparent to those skilled in the art from the teachings of the present invention. 
For example, the expression vehicle for engineering cells may be other than a 
retrovirus, for example, an adenovirus which may be used to engineer cells in 

25 vivo after combination with a suitable delivery vehicle. 

Retroviruses from which the retroviral plasmid vectors mentioned above 
may be derived include, but are not limited to, Moloney Murine Leukemia Virus, 
spleen necrosis virus, retroviruses such as Rous Sarcoma Virus, Harvey Sarcoma 
Virus, avian leukosis virus, gibbon ape leukemia virus, human immunodeficiency 

30 virus, adenovirus, Myeloproliferative Sarcoma Virus, and mammary tumor vims. 

9- 
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The retroviral plasmid vector is employed to transduce packaging cell 
lines to form producer cell lines. Examples of packaging cells which may be 
transfected include, but are not limited to, the PE501, PAS 17, psi'2, psi-AM, 
PA12, T19'14X, VT-J9-I7-H2, psi-CRE, psi-CRIP, GP-^E-86, GP^envAmI2, 
5 and DAN cell lines as described in Miller (Human Gene Therapy, Vol. 1, 
pg. 5-14, (1990)). The vector may transduce the packaging cells through any 
means know^n in the art. Such means include, but are not limited to, 
electroporation, the use of liposomes, and CaP04 precipitation. In one alternative, 
the retroviral plasmid vector may be encapsulated into a liposome, or coupled to a 
10 lipid, and then administered to a host. 

The producer cell line generates infectious retroviral vector particles 
which include the nucleic acid sequence(s) encoding the polypeptides. Such 
retroviral vector particles then may be employed, to transduce eukaryotic cells, 
either in vitro or in vivo. The transduced eukaryotic cells will express the nucleic 
15 acid sequence(s) encoding the polypeptide. Eukaryotic cells which may be 
transduced include, but are not limited to, embryonic stem cells, embryonic 
carcinoma cells, as well as hematopoietic stem cells, hepatocytes, fibroblasts, 
myoblasts, keratinocytes, endothelial cells, and bronchial epithelial cells. 

The genes introduced into cells may be placed under the control of 
20 inducible promoters, such as the radiation-inducible Egr-1 promoter, (Maceri, 
H.J., et al. Cancer Res,, 56(19):4311 (1996)), to stimulate variant production or 
antisense inhibition in response to radiation, eg., radiation therapy for treating 
tumors. 

25 Example III. Variant product 

The substantially purified variant product of the invention has been 
defined above as the product coded from the nucleic acid sequence of the 
invention. Preferably the amino acid sequence is an amino acid sequence having 
at least 90% identity to any one of the sequences identified as SEQ ID NO: 37 to 
30 SEQ ID NO: 72 provided that the amino acid sequence is not identical to that of 
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the original sequence from which it has been varied. The protein or polypeptide 
may be in mature and/or modified fonn, also as defined above. Also 
contemplated are protein fragments having at Teast 10 contiguous amino acid 
residues, preferably at least 10-20 residues, derived from the variant product, as 
5 well as homologues as explained above. 

The sequence variations are preferably those that are considered conserved 
substitutions, as defined above. Thus, for example, a protein with a sequence 
having at least 90% sequence identity with any of the products identified as SEQ 
ID NO: 37 to SEQ ID NO: 72, preferably by utilizing conserved substitutions as 

10 defined above is also part of the invention, and provided that it is not identical to 
the original peptide from which it has been varied. In a more specific 
embodiment, the protein has or contains any one of the sequence identified as 
SEQ ID NO: 37 to SEQ ID NO: 72. The variant product may be (i) one in which 
one or more of the amino acid residues in a sequence listed above are substituted 

15 with a conserved or non-conserved amino acid residue (preferably a conserved 
amino acid residue), or (ii) one in which one or more of the amino acid residues 
includes a substituent group, or (iii) one in which the variant product is fused 
with another compound, such as a compound to increase the half-life of the 
protein (for example, polyethylene glycol (PEG)), or a moiety which serves as 

20 targeting means to direct the protein to its target tissue or target cell population 
(such as an antibody), or (iv) one in which additional amino acids are fused to the 
variant product. Such fragments, variants and derivatives are deemed to be within 
the scope of those skilled in the art from the teachings herein. 

25 A. Preparation of variant product 

Recombinant methods for producing and isolating the variant product, and 
fragments of the protein are described above. 

In addition to recombinant production, fragments and portions of variant 
product may be produced by direct peptide synthesis using solid-phase techniques 
30 (cf. Stewart et aL, (1969) Solid-Phase Peptide Synthesis, WH Freeman Co, San 
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Francisco; Merrifield J., J. Am. Chem. Soc, 85:2149-2154, (1963)). In vitro 
peptide synthesis may be performed using manual techniques or by automation. 
Automated , synthesis may be achieved, for example, using AppUed Biosystems 
431 A Peptide Synthesizer (Perkin Elmer, Foster City, Calif) in accordance with 
5 the instructions provided by the manufacturer. Fragments of variant product may 
be chemically synthesized separately and combined using chemical methods to 
produce the full length molecule. 

II. Therapeutic uses and compositions utilizing the variant product 

\o 

The variant product of the invention is generally useful in treating diseases 
and disorders which are characterized by a lower than normal level of variant 
expression, and or diseases which can be cured or ameliorated by raising the level 
of the variant product, even if the level is normal. 

15 Variant products or fragments may be administered by any of a number of 

routes and methods designed to provide a consistent and predictable 
concentration of compound at the target organ or tissue. The product-containing 
compositions may be administered alone or in combination with other agents, 
such as stabilizing compounds, and/or in combination with other pharmaceutical 

20 agents such as drugs or hormones. 

Variant product-containing compositions may be administered by a 
number of routes including, but not limited to oral, intravenous, intramuscular, 
transdermal, subcutaneous, topical, sublingual, or rectal means as well as by nasal 
application. Variant product-containing compositions may also be administered 

25 via liposomes. Such administration routes and appropriate formulations are 
generally known to those of skill in the aft. 

The product can be given via intravenous or intraperitoneal injection. 
Similarly, the product may be injected to other localized regions of the body. The 
product may also be administered via nasal insufflation. Enteral administration is 

30 also possible. For such administration, the product should be formulated into an 
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appropriate capsule or elixir for oral administration, or into a suppository for 
rectal administration. 

The foregoing exemplary administration modes will likely require that the 
product be formulated into an appropriate carrier, including ointments, gels, 
5 suppositories. Appropriate formulations are well known to persons skilled in the 
art. 

Dosage of the product will vary, depending upon the potency and 
therapeutic index of the particular polypeptide selected. 

A therapeutic composition for use in the treatment method can include the 
10 product in a sterile injectable solution, the polypeptide in an oral delivery vehicle, 
the product in an aerosol suitable for nasal administration, or the product in a 
nebulized form, all prepared according to well known methods. Such 
compositions comprise a therapeutically effective amount of the compound, and a 
pharmaceutically acceptable carrier or excipient. Such a carrier includes but is not 
15 limited to saline, buffered saline, dextrose, water, glycerol, ethanol, and 
combinations thereof. The product of the invention may also be used to modulate 
endothelial differentiation and proliferation as well as to modulate apoptosis 
either ex vivo or in vitro, for example, in cell cultures. 

20 Example IV. Screening methods for activators and deactivators (inhibitors) 

The present invention also includes an assay for identifying molecules, 
such as synthetic drugs, antibodies, peptides, or other molecules, which have a 
modulating effect on the activity of the variant product, e.g. activators or 

25 deactivators of the variant product of the present invention. Such an assay 
comprises the steps of providing an variant product encoded by the nucleic acid 
sequences of the present invention, contacting the variant protein with one or 
more candidate molecules to determine the candidate molecules modulating 
effect on the activity of the variant product, and selecting from the molecules a 

30 cemdidate's molecule capable of modulating variant product physiological 
activity. 
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The variant product, its catalytic or immunogenic fragments or 
oligopeptides thereof, can be used for screening therapeutic compounds in any of 
a variety of drug screening techniques. The fragment employed in such a test 
may be free in solution, affixed to a solid support, borne on a cell membrane or 
5 located intracellularly. The formation of binding complexes, between variant 
product and the agent being tested, may be measured. Alternatively, the activator 
or deactivator may work by serving as agonist or antagonist, respectively, of the 
variant receptor, binding entity or target site, and their effect may be determined 
in connection with any of the above. 

10 Another technique for drug screening which may be used provides for 

high throughput screening of compounds having suitable binding affinity to the 
variant product is described in detail by Geysen in PCT Application WO 
84/03564, published on Sep. 13, 1984. In summary, large numbers of different 
small peptide test compounds are synthesized on a solid substrate, such as plastic 

15 pins or some other surface. The peptide test compounds are reacted with the full 
variant product or with fragments of variant product and washed. Bound variant 
product is then detected by methods well known in the art. Substantially purified 
variant product can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, non-neutralizing 

20 antibodies can be used to capture the peptide and immobilize it on a solid 
support. 

Antibodies to the variant product, as described in Example VI below, may 
also be used in screening assays according to methods well known in the art. For 
example, a ''sandwich'' assay may be performed, in which an anti-variant 
25 antibody is affixed to a solid surface such as a microtiter plate and variant 
product is added. Such an assay can be used to capture compounds which bind to 
the variant product. Alternatively, such an assay may be used to measure the 
ability of compounds to influence with the binding of variant product to the 
variant receptor, and then select those compounds which effect the binding. 

30 
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Example V. Anti-variant antibodies 
A. Synthesis 

In still another aspect of the invention, the purified variant product is used 
to produce anti-variant antibodies which have diagnostic and therapeutic uses 
5 related to the activity, distribution, and expression of the variant product. 

Antibodies to the variant product may be generated by methods well 
known in the art. Such antibodies may include, but are not limited to, polyclonal, 
monoclonal, chimeric, humanized, single chain. Fab fragments and fragments 
produced by an Fab expression library. Antibodies, i.e., those which inhibit 

!0 dimer fonnation, are especially preferred for therapeutic use. 

A fragment of the variant product for antibody induction does not require 
biological activity but have to feature immunological activity; however, the 
protein fragment or oligopeptide must be antigenic. Peptides used to induce 
specific antibodies may have an amino acid sequence consisting of at least five 

15 amino acids, preferably at least 10 amino acids of the sequences specified in any 
one of SEQ ID NO: 37 to SEQ ID NO: 72. Preferably they should mimic a 
portion of the amino acid sequence of the natural protein and may contain the 
entire amino acid sequence of a small, naturally occurring molecule. Short 
stretches of variant protein amino acids may be fused with those of another 

20 protein such as keyhole limpet hemocyanin and antibody produced against the 
chimeric molecule. Procedures well known in the art can be used for the 
production of antibodies to variant product. 

For the production of antibodies, various hosts including goats, rabbits, 
rats, mice, etc may be immunized by injection with variant product or any 

25 portion, fragment or oligopeptide which retains inmiunogenic properties. 
Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include but are not limited to Freund's, 
mineral gels such as aluminum hydroxide, and surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
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hemocyanin, and dinitrophenol. BCG (bacilli Calmette-Guerin) and 
Corynebacterium parvum are potentially useful human adjuvants. 

Monoclonal antibodies to variant protein may be prepared using any 
technique which provides for the production of antibody molecules by continuous 
5 cell lines in culture. These include but are not limited to the hybridoma technique 
originally described by Koehler and Milstem {Nature 256:495-497, (1975)), the 
human B-cell hybridoma technique (Kosbor et al, Immunol Today 4:72, (1983); 
Cote etal, Proc. Natl Acad. ScL 80:2026-2030, (1983)) and the EBV-hybridoma 
technique (Cole, et al, Mol Cell Biol 62:109-120, (1984)). 

10 Techniques developed for the production of "chimeric antibodies", the 

splicing of mouse antibody genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity can also be used 
(Morrison et al, Proc. Natl Acad. Sci, 81:6851-6855, (1984); Neuberger et al, 
Nature 312:604-608, (1984); Takeda et al, Nature 314:452-454, (1985)). 

15 Alternatively, techniques described for the production of single chain antibodies 
(U.S. Pat. No, 4,946,778) can be adapted to produce single-chain antibodies 
specific for the variant protein. 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening recombinant immunoglobulin libraries or 

20 panels of highly specific binding reagents as disclosed in Orlandi et aL {Proc, 
Natl Acad. Sci. 86:3833-3837, 1989)), and Winter G and Milstein C, {Nature 
349:293-299,(1991)). 

Antibody fragments which contain specific binding sites for variant 
protein may also be generated. For example, such fragments include, but are not 

25 limited to, the F(ab')2 fragments which can be produced by pepsin digestion of 
the antibody molecule and the Fab fragments which can be generated by reducing 
the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of monoclonal 
Fab fragments with the desired specificity (Huse W.D. et al, Science 

30 256:1275-1281,(1989)). 
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B. Diagnostic applications of antibodies 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 
5 specificities are well known in the art. Such immunoassays typically involve the 
formation of complexes between the variant product and its specific antibody and 
the measurement of complex formation. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two noninterfering 
epitopes on a specific variant product is preferred, but a competitive binding 

10 assay may also be employed. These assays are described in Maddox D.E., et al, 
{J. Exp, Med. 158:1211, (1983)). 

Antibodies which specifically bind variant product are useful for the 
diagnosis of conditions or diseases characterized by expression of the novel 
variant of the invention (where normally it is not expressed) by over or under 

1 5 expression of variant as well as for detection of diseases in which the proportion 
between the amount of the variants of the invention and the original sequence 
from which it varied is altered. Alternatively, such antibodies may be used in 
assays to monitor patients being treated with variant product, its activators, or its 
deactivators. Diagnostic assays for variant protein include methods utilizing the 

20 antibody and a label to detect variant product in human body fluids or extracts of 
cells or tissues. The products and antibodies of the present invention may be 
used with or without modification. Frequently, the proteins and antibodies will be 
labeled by joining them, either covalently or noncovalently, with a reporter 
molecule, A wide variety of reporter molecules are known in the art. 

25 A variety of protocols for measuring the variant product, using either 

polyclonal or monoclonal antibodies specific for the respective protein are known 
in the art. Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA), and fluorescent activated cell sorting (FACS). As 
noted above, a two-site, monoclonal-based immunoassay utilizing monoclonal 

30 antibodies reactive to two non-interfering epitopes on variant product is 
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preferred, but a competitive binding assay may be employed. These assays are 
described, among other places, in Maddox, et al. (supra). Such protocols provide 
a basis for diagnosing altered or abnormal levels of variant product expression. 
Normal or standard values for yariant product expression are established by 
5 combining body fluids or cell extracts taken from normal subjects, preferably 
human, with antibody to variant product under conditions suitable for complex 
formation which are well known in the art. The amount of standard complex 
formation may be quantified by various methods, preferably by photometric 
methods. Then, standard values obtained from normal samples may be compared 
10 with values obtained from samples from subjects potentially affected by disease. 
Deviation between standard and subject values establishes the presence of disease 
state. 

The antibody assays are useful to determine the level of variant product 
present in a body fluid sample, in order to determine whether it is being 
15 expressed at all, whether it is being overexpressed or underexpressed in the 
tissue, or as an indication of how variant levels of variable products are 
responding to drug treatment. 

By another aspect the invention concerns methods for determining the 
presence or level of various anti-variant antibodies in a biological sample 
20 obtained from patients, such as blood or serum sample using as an antigen the 
variant product. Determination of said antibodies may be indicative to a plurality 
of pathological conditions or diseases. 

C. Therapeutic uses of antibodies 

25 In addition to their diagnostic use the antibodies may have a therapeutical 

utility in blocking or decreasing the activity of the variant product in pathological 
conditions where beneficial effect can be achieved by such a decrease. 

The antibody employed is preferably a humanized monoclonal antibody, 
or a human Mab produced by known globulin-gene library methods. The 

30 antibody is administered typically as a sterile solution by IV injection, although 
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other parenteral routes may be suitable. Typically, the antibody is administered 
in an amount between about 1-15 mg/kg body weight of the subject. Treatment is 
continued, e.g., with dosing every 1-7 days, until a therapeutic improvement is 
seen. 

5 Although the invention has been described with reference to specific 

methods and embodiments, it is appreciated that various modifications and 
changes may be made without departing from the invention. 
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CLAIMS: 

1. An isolated nucleic acid sequence, of an alternative splicing variant, selected 
from the group consisting of: 

(i) the nucleic acid sequence depicted in any one of SEQ ID NO: 1 to 
5 SEQ ID NO: 36; 

(ii) nucleic acid sequences having at least 90% identity with the 
sequence of (i) with the proviso that each sequence is different than the original 
nucleic acid sequence from which the sequences of (i) have been varied by 
altemative splicing; and 

10 (iii) fragments of (i) or (ii) of at least 20 b.p., provided that said fragment 

contains a sequence which is not present, as a continuous stretch of nucleotides, in 
the original nucleic acid sequence from which the sequences of(i) have been 
varied by altemative splicing. 

2. An isolated nucleic acid sequence complementary to the nucleic acid 
1 5 sequence of Claim 1 . 

3. An amino acid sequence selected from the group consisting of: 

(i) an amino acid sequence coded by the isolated nucleic acid sequence 
of altemative splice variants of Claim 1 ; 

(^) homologues of the amino acid sequences of (i) in which one or more 
20 amino acids has been added, deleted, replaced or chemically 

modified in the region or adjacent to the region where the amino 
acid sequences differs from the original amino acid sequence, coded 
by the original nucleic acid sequence from which the variant has been 
varied. 

25 4. An amino acid sequence according to Claim 3, as depicted in any one of 
SEQ ID NO: 37 to SEQ ID NO: 72. 

5. An isolated nucleic acid sequence coding for any one of the amino acid 
sequences of Claim 3 or 4. 

6. A purified antibody which binds specifically to any of the amino acid 
30 sequence of Claim 3 or 4. 
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7. An expression vector comprising any one of the nucleic acid sequences of 
Claim 1 or 5 and control elements for the expression of the nucleic acid sequence in 
a suitable host. 

8. An expression vector comprising any one of the nucleic acid sequences of 
5 Claim 2, and control elements for the expression of the nucleic acid sequences in a 

suitable host. 

9. A host cell transfected by the expression vector of Claim 7 or 8. 

10- A ph£Lrmaceutical composition comprising a pharmaceutically acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

1 0 (i) the expression vector of Claim 7; and 

(ii) any one of the amino acid sequences of Claim 3 or 4. 
11. A pharmaceutical composition according to Claim 10, for treatment of 
diseases which can be ameliorated or cured by raising the level of any one of the 
amino acid sequences depicted in SEQ ID NO: 37 to SEQ ID NO: 72. 

15 12. A pharmaceutical composition comprising a pharmaceutically acceptable 
carrier and as an active ingredient an agent selected from the group consisting of: 

(i) any one of the nucleic acid sequences of Claim 2; 

(ii) the expression vector of Claim 8; and 

(iii) the purified antibody of Claim 6. 

20 13. A pharmaceutical composition according to Claim 12, for treatment of 
diseases which can be ameliorated or cured by decreasing the level of any one of 
the amino acid sequences depicted in SEQ ID NO: 37 to SEQ ID NO: 72. 
14. A method for detecting an variant nucleic acid sequence in a biological 
sample, comprising the steps of: 

25 (a) hybridizing to nucleic acid material of said biological sample any one 

of the nucleic acid sequences of Claim 1 or 2; and 
(b) detecting said hybridization complex; 

wherein the presence of said hybridization complex correlates with the 
presence of an variant nucleic acid sequence in the said biological sample. 
30 15. A method for determining the level of variant nucleic acid sequences in a 
biological sample comprising the steps of: 

I" 
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(a) hybridizing to nucleic acid material of said biological sample any one 
of the nucleic acid sequences of Claim 1 or 2; and 

(b) determining the amount of hybridization complexes and normalizing 
said amount to provide the level of the variant nucleic acid sequences in the 

5 sample. 

16. A method for determining the ratio between the level of variant of the 
nucleic acid sequence in a first biological sample and the level of the original 
sequence from which the variant has been varied by alternative splicing in a second 
biological sample comprising: 
1 0 (i) determining the level of the variant nucleic acid sequence in the first 

biological sample according to the method of Claim 15; . 

(ii) detennining the level of the original sequence in the second biological 
sample; and 

(iii) comprising the levels obtained in (a) and (b) to give said ratio. 

15 17. A method according to Claim 16, wherein said first and said second 
biological samples are the same sample. 

18. A method according to any of Claims 14 to 17, wherein the nucleic acid 
material of said biological sample are mRNA transcripts. 

19. A method according to Claim 18, where the nucleic acid sequence is present 
20 in a nucleic acid chip. 

20. A method for identifying candidate compounds capable of binding to the 
variant product and modulating its activity the method comprising: 

(i) providing any one of the amino acid sequences as defined in Claim 3 

or 4; 

25 (ii) contacting a candidate compound with said amino acid sequence; 

(iii) determining the effect of said candidate compound on the biological 
activity of said protein or polypeptide and selecting those compounds which show a 
significant effect on said biological activity. 

21. A method according to Claim 20, wherein the compound is an activator and 
30 the measured effect is increase in the biological activity. 
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22. A method according to Claim 20, wherein the compound is an deactivator 
and the effect is decrease in the biological activity. 

23. An activator of any one of the amino acid sequences of Claim 3 or 4. 

24. An deactivator of any one of the amino acid sequences of Claims 3 or 4. 

5 25. A method for detecting any one of the amino acid sequences of Claim 3 or 4 
in a biological sample, comprising the steps of: 

(a) contacting with said biological sample the antibody of Claim 8, 
thereby foraiing an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

10 wherein the presence of said antibody-antigen complex correlates with the 

presence of the desired amino acid in said biological sample. 

26- A method for detecting the level of the amino acid sequence of any one of 
Claim 3 or 4 in a biological sample, comprising the steps of; 

(a) contacting with said biological sample the antibody of Claim 8, 
15 thereby forming an antibody-antigen complex; and 

(b) detecting the amount of said antibody-antigen complex and 
normalizing said amount to provide the level of said amino acid sequence in the 
sample. 

27. A method for determining the ratio between the level of any one of the 
20 amino acid sequences of Claims 3 or 4 present in a first biological sample and the 

level of the original amino acid sequences from which they were varied by 
alternative splicing, present in a second biological sample, the method comprising: 
(i) determining the level of the amino acid sequences of Claims 3 or 4 
into a first sample by the method of Claim 26; 
25 (ii) determining the level of the original amino acid sequence in the 

second sample; and 

(iii) comparing the level obtained in (a) and (b) to give said ratio. 

28. A method according to Claim 27, wherein said first and said second 
biological samples are the same sample. 

30 29. A method for detecting any one of the antibodies of Claim 6 in a biological 
sample comprising the steps of: 

,v- 

I 
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(a) contacting said biological sample with any one of the amino acid 
sequences of Claim 3 or 4 thereby forming an antibody-antigen complex; and 

(b) detecting said antibody-antigen complex 

wherein the presence of said antibody-antigen complex correlates with the 
5 presence of the antibody in said biological sample. 

30. A method for detecting the level of any one of the antibodies of Claim 6 in a 
biological sample comprising the steps of: 

(i) contacting said biological sample with any one of the amino acid 
sequences of Claim 3; 
10 (ii) detecting the amount of said antibody-antigen complex and 

normalizing said amount to provide the levels of said antibody in the sample. 

For the Applicants, 

REINHOLD COHN AND PARTNERS 

By: 
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SEQUENCE LISTING 



<110> COMPUGEN LTD. 

<120> SPLICE VARIANTS OF ONCOGENES 

<130> 1245448-COMPUGEN 

<140> 
<141> 

<160> 72 

<170> PatentIn Ver. 2.1 



<210> 1 

<211> 1735 

<212> DNA 

<213> Homo sapiens 



<400> 1 

gtccaccgga 

ctgaggaccg 

gtgggcttta 

aaggtgataa 

ttagtcctgt 

aacatcacag 

agcaaaatcc 

gatgtgattc 

actgacctca 

aaatgtgaca 

ggggacagag 

ggaattaccg 

gaaggtgaag 

gtcttcaaga 

aatttgtcta 

tggacagttg 

tgcttaaatg 

tatggaagtg 

gaggggaagt 

ttcatgggaa 

gcactttcct 

gcttatgaca 

tatgagtgtt 

tcatccttga 

gctttgattg 

ttgtttccaa 

gaaggatgaa 

attggccagt 

gaagaataac 



agcgagttgc 
gcaacatggt 
ccatgagtaa 
ccatgtttgt 
ttggtacaga 
tgcacagaca 
aaccaggttc 
aacatgaaac 
gcagccgatt 
tctccctgca 
gagatggccc 
aacagcaaaa 
atgggttgga 
aaattgagag 
taaggattgc 
tggatgcaaa 
atgatgatga 
atatagttcc 
gcttctctgt 
atcaagttct 
ccctgattca 
aaagagctaa 
tagtgtatgt 
aaaacagtaa 
actccatgag 
ccaccaaaat 
caagtcatat 
cctaaataaa 
atcttcctct 



gacacggcag 
gcggtcgggg 
ctccattcct 
acagcgacag 
tggcactgac 
tctgatgcta 
tcaacaggct 
aataggaaag 
cagcaaaagt 
attcttcttg 
ctttcgctta 
agaaggtctt 
tgaaatttat 
gcattccatt 
agcctataaa 
aaccctaaaa 
aactgaagtt 
tttctctaaa 
tttgggattt 
aaaggtcttt 
tgctttggat 
tcctcaagtc 
gcagctgcct; 
gaaatatgct 
cttggcaaag 
cccaaatcct 
ttcttttaaa 
tgtctttttt 
cctacttcag 



gttcccgccc 
aataaggcag 
ggtatagaat 
gtgtttgctg 
aatccccttt 
ccagattttg 
gacttcctgg 
aagtttgaga 
cagctggata 
cctttctcac 
ggtggccatg 
gagatagtga 
tcattcagtg 
cactggccct 
tcgattctac 
aaagaagata 
ttaaaagagg 
gtggatgagg 
tgtaaatctt 
gcagcaagag 
gacttagaca 
ggcgtggctt 
ttcatggaag 
cccaccgagg 
aaagatgaga 
cgatttcaga 
tgaaagagag 
ctctgaaggc 
aaggtaaaaa 



ggaagaagcg 
ctgttgtgct 
ccccatttga 
agaacaagga 
ctggtgggga 
atttgctgga 
atgcactaat. 
agaggcatat 
ttataattca 
ttggcaagga 
ggccttcctt 
aaatggtgat 
agagtctgag 
gccgactgac 
aggagagagt 
tacaaaaaga 
atattattca 
aacaaatgaa 
ctcaggttca 
atgatgaggc 
tggtggccat 
ttcctcatat 
acttgcggca 
cacagttgaa 
agacagacac 
gattatttca 
ctaagtgcaa 
cccatttgct 
tattctgagc 



accaaagcgc 
gtgtatggac 
acaagcaaag 
tgagattgct 
tcagtatcag 
ggacattgaa 
cgtgagcatg 
tgaaatattc 
tagcttgaag 
agatggaagt 
tccactaaaa 
gatatcttta 
aaaactgtgc 
cattggctcc 
taaaaagact 
aacagtttat 
agggttccgc 
atataaatcg 
gagaagattc 
agctgcagtt 
agttcgatat 
caagcataac 
atacatgttt 
tgctgttgat 
ccttgaagac 
ggtaagagaa 
agttgcggta 
ctatttaatg 
tctta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1735 



<210> 2 

<211> 2766 

<212> DNA 

<213> Homo sapiens 

<400> 2 

gtccaccgga agcgagttgc gacacggcag gttcccgccc ggaagaagcg accaaagcgc 60 
ctgaggaccg gcaacatggt gcggtcgggg aataaggcag ctgttgtgct gtgtatggac 120 
gtgggcttta ccatgagtaa ctccattcct ggtatagaat ccccatttga acaagcaaag 180 



1 



aaggtgataa 
ttagtcctgt 
aacatcacag 
agcaaaatcc 
gatgtgattc 
actgacctca 
aaatgtgaca 
ggggacagag 
ggaattaccg 
gaaggtgaag 
gtcttcaaga 
aatttgtcta 
tggacagttg 
tgcttaaatg 
cagattcctc 
caagtgactg 
ttaaagactg 
acctctgttg 
gccagctttg 
aatgaaacac 
attaagtttt 
gaaattaaac 
accaaagagg 
cccaaagaca 
gatttattgg 
tgtgatgctg 
ctcaagaaat 
acacacatat 
ttattttctg 
catgctttga 
cagcccttgt 
cttaaccctt 
gagcaagcag 
aggacggact 
gtatataggt 
cttttaagca 
tgccatatca 
ttatatcact 
aaacacttag 
ggatggtgtc 
tcctcattct 
cacctctgtc 
ttaaaggcat 
atgtgt 



ccatgtttgt 
ttggtacaga 
tgcacagaca 
aaccaggttc 
aacatgaaac 
gcagccgatt 
tctccctgca 
gagatggccc 
aacagcaaaa 
atgggttgga 
aaattgagag 
taaggattgc 
tggatgcaaa 
atgatgatga 
tctctaaaat 
ctcaggaaat 
agcaaggggg 
gaagtgtgaa 
aggaagcgag 
cgtattttat 
cagaagagca 
aattaaatca 
aagcctctgg 
aaccaagtgg 
acatgatata 
ggagttctaa 
tcccagcagg 
atattacaag 
gttggtgtat 
agtttctgga 
gatgtgatta 
tccagagtcc 
tagcatttgc 
cacttctctt 
ggttttcttt 
gtgagttatg 
gtgaaccttt 
tcactgttct 
gtaacttttc 
t cctttacta 
tgccttgagt 
ttcagcaccc 
aaatgactca 



acagcgacag 
tggcactgac 
tctgatgcta 
tcaacaggct 
aataggaaag 
cagcaaaagt 
attcttcttg 
ctttcgctta 
agaaggtctt 
tgaaatttat 
gcattccatt 
agcctataaa 
aaccctaaaa 
aactgaactg 
aaagaccctt 
tttccaagac 
agcccacttc 
tcctgctgaa 
taaccagctc 
gaagagcata 
gcgctttaac 
tttctgggaa 
aagttctgtc 
agacacagca 
ggtcgtggat 
caaaacaagt 
ttacctggag 
ggataattta 
tattttttct 
aagtagatct 
gtgtctcatg 
tcctttgcct 
ttcctcccaa 
tttagttgag 
aagtggggtg 
gtggtggtct 
agaaactcaa 
ctacttgcaa 
ggtctttccc 
aataagaaaa 
tccagttcct 
tcataagtcg 
tcaaaaaaaa 



gtgtttgctg 
aatccccttt 
ccagattttg 
gacttcctgg 
aagtttgaga 
cagctggata 
cctttctcac 
ggtggccatg 
gagatagtga 
tcattcagtg 
cactggccct 
tcgattctac 
aaagaagata 
aatcctcccg 
tttcctctga 
aaccatgaag 
agcgtctcca 
aacttccgtg 
ataaatcaca 
gactgcatcc 
aacttcctga 
attgttgtcc 
acagctgagg 
gctgtatttg 
gtatggggaa 
tggatgcggc 
gcggatcatc 
gaccccatac 
gtggtcttac 
tttcttgacc 
tggaaccatg 
gatcctccaa 
caagcagctg 
gccttctagt 
ggaaggggag 
catgaagaaa 
aaactgagaa 
gcctcaaaga 
atttctacct 
taacaaagcc 
ctttggtgta 
tcactaatac 
aaaaaaaaaa 



agaacaagga 
ctggtgggga- 
atttgctgga 
atgcactaat 
agaggcatat 
ttataattca 
ttggcaagga 
ggccttcctt 
aaatggtgat 
agagtctgag 
gccgactgac 
aggagagagt 
tacaaaaaga 
ctgaggtgac 
ttgaagccaa 
atggacctac 
gtctggctga 
ttctagtgaa 
tcgaacagtt 
gagccttccg 
aagcccttca 
aggatggaat 
aagccaaaaa 
aagaaggtgg 
tctaagagag 
cattcaaggg 
taattctctg 
aagtttataa 
tgatctttgt 
tagtatatca 
gcatggttat 
cagctgtcac 
ggttaggaaa 
taccacatta 
cacaatttcc 
agaccttttg 
atttactaca 
gagaaagttt 
aagtcagctt 
cttattctct 
cagacttctt 
acagttttgt 
aattatggta 



tgagattgct 
"tcagtatcag 
ggacattgaa 
cgtgagcatg 
tgaaatattc 
tagcttgaag 
agatggaagt 
tccactaaaa 
gatatcttta 
aaaactgtgc 
cattggctcc 
taaaaagact 
aacagtttat 
aacgaaaagt 
gaaaaaggat 
agctaaaaaa 
aggcagtgtc 
acagaagaag 
tttggatact 
ggaagaagcc 
agagaaagtg 
tactctgatc 
gtttctggcc 
tgatgtggac 
ctgccatcgc 
gagccaaaat 
tggaatgaat 
agagtcattg 
atattacata 
gtgacagttg 
tgatgagttt: 
aacttgtgtt 
accatgggta 
ctctgcctct 
cttcatactc 
gcccaatctc 
gtagttagaa 
cgttatatta 
tcatctttgt 
ttttttcttg 
ggtacccagt 
acatgtaaca 
cctcgaatca 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2766 



<210> 3 

<211> 2034 

<212> DNA 

<213> Homo sapiens 



<400> 3 

ggagacaggt 

gccaggagga 

gtggctgcag 

gccattatcc 

aggtgcggga 

acaacctggt 

agaaggggga 

tgaccacggg 

agcccgaacc 



ggtggctacg 
ccatgtgaat 
ctcacacccg 
catagtccca 
cccactggtt 
tatcgctctg 
acagctccgc 
ccaggaaggc 
ctggtt cttc 



acggcraagg 
ggggccagag 
gaagatgact 
ctggatggca 
acctacgaag 
cacagctatg 
atcctggagc 
ttcatcccct 
aagaacctga 



gagctgagac 
ggctcccggg 
ggatggaaaa 
agggcacgct 
gctccaatcc 
agccctctca 
agagcggcga 
tcaattttgt 
gccgcaagga 



tgtccaggca 
ctgggcaggg 
catcgatgtg 
gctcatccga 
gccggctt cc 
cgacggagat 
gtggtggaag 
ggccaaagcg 
cgcggagcgg 



gccaggttag 
accatgggct 
tgtgagaact 
aatggctctg 
ccactgcaag 
ctgggctttg 
gcgcagtccc 
aacagcctgg 
cagctcctgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 



2 



cgcccgggaa 
tttcactgtc 
tccgtaatct 
atgaactggt 
cctgccagac 
agacgctgaa 
actacaacgg 
acgccttcct 
tctacgctgt 
gtctagtgga 
acatggcagc 
gtgaccttcg 
ttggcctagc 
atcctgctga 
gaggtgattc 
gagctgtacc 
gactacctgc 
cagccttgag 
gggagatgga 
aatcccaccc 
tggactctgc 
aaggtacccc 
gattgacaga 
agggccagga 
atcaggagtt 



cactcacggc 
ggtccgggac 
ggacaacggt 
ccgccattac 
ccagaagccc 
gctggtggag 
gcacacgaag 
ggccgaggcc 
ggtcacccag 
ttttctcaag 
ccaaattgca 
ggctgccaac 
acgcctcatt 
cggaaattgt 
agaacctgga 
aactcatgag 
gcagtgtgct 
aggccttgag 
gttcttgtgc 
acatgtgaca 
acatgtcttg 
tttctggctc 
agcttctgcc 
ctttatctaa 
caataaatgt 



tccttcctca 
ttcgaccaga 
ggcttctaca 
accaatgctt 
cagaagccgt 
cggctggggg 
gtggcggtga 
aacctcatga 
gagcccatct 
accccttcag 
gaaggcatgg 
attctggtgt 
gaggacattc 
cacccacggc 
gcgaggctac 
gctgtgctgg 
ggaggacttc 
aggccctggg 
catagtcaca 
catatgcacc 
tacatgtgta 
tcccatttcc 
cacctacttt 
tacctctgtg 
ctgttgatga 



tccgggagag 
accagggaga 
tctcccctcg 
cagatgggct 
ggtgggagga 
ctggacagtt 
agagcctgaa 
agcagctgca 
acatcatcac 
gcatcaagtt 
cattcattga 
ctgacaccct 
accatcaagt 
cgcatccctt 
cgcatggtgc 
aaggagcgcc 
ttcacggcca 
gttctccccc 
tggcctatgc 
ttgtgtctgt 
gcctgtgcat 
tgagaccaca 
tctttcctca 
tgctcctcct 
ctgttgtaca 



cgagagcacc 
ggtggtgaaa 
aatcactttt 
gtgcacacgg 
cgagtgggag 
cggggaggtg 
gcagggcagc 
acaccagcgg 
tgaatacatg 
gaccatcaac 
agagcggaat 
gagctgcaag 
cagatgtgtg 
acccagggat 
gccctgacaa 
cagaggaccg 
cagagggcca 
tttctctcca 
acatatggac 
acacgtgtcc 
gtatgtcttg 
gagagagggg 
gatcatccag 
tggtgcctgg 
caacaacaaa 



gcgggatcgt 
cattacaaga 
cccggcctgc 
ttgagccgcc 
gttcccaggg 
tggatggggt 
atgtccccgg 
ctggttcggc 
gagaatggga 
aaactcctgg 
tatattcatc 
attgcagact 
gtcttttggg 
gaccaacccg 
ctgtccagag 
gcccaccttt 
gtaccagcct 
gcctgacttg 
tctgcacatg 
tgtagttgcg 
gacactgtac 
agaagcctgg 
aagttcctca 
cctggcacac 
acaa 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
-1800 
1860 
1920 
1980 
2034 



<210> 4 

<211> 2282 

<212> DNA 

<213> Homo sapiens 

<400> 4 

ggagacaggt ggtggctacg acggcraagg gagctgagac tgtccaggca gccaggttag 60 

gccaggagga ccatgtgaat ggggccagag ggctcccggg ctgggcaggg accatgggct 120 

gtggctgcag ctcacacccg gaagatgact ggatggaaaa catcgatgtg tgtgagaact 180 

gccattatcc catagtccca ctggatggca agggcacgct gctcatccga aatggctctg 240 

aggtgcggga cccactggtt acctacgaag gctccaatcc gccggcttcc ccactgcaag 300 

gtgaccccag gcagcagggc ctgaaagaca aggcctgcgg atccctggct gttggcttcc 360 

acctctcccc cacctacttt ctccccggtc ttgccttcct tgtcccccac cctgtaactc 420 

caggcttcct gccgatccca gctcggttct ccctgacgcc ccttgtcttt acagacaacc 480 

tggttatcgc tctgcacagc tatgagccct ctcacgacgg agatctgggc tttgagaagg 540 

gggaacagct ccgcatcctg gagcagagcg gcgagtggtg gaaggcgcag tccctgacca 600 

cgggccagga aggcttcatc cccttcaatt ttgtggccaa agcgaacagc ctggagcccg 660 

aaccctggtt cttcaagaac ctgagccgca aggacgcgga gcggcagctc ctggcgcccg 720 

ggaacactca cggctccttc ctcatccggg agagcgagag caccgcggga tcgttttcac 780 

tgtcggtccg ggacttcgac cagaaccagg gagaggtggt gaaacattac aagatccgta 840 

atctggacaa cggtggcttc tacatctccc ctcgaatcac ttttcccggc ctgcatgaac 900 

tggtccgcca ttacaccaat gcttcagatg ggctgtgcac acggttgagc cgcccctgcc 960 

agacccagaa gccccagaag ccgtggtggg aggacgagtg ggaggttccc agggagacgc 1020 

tgaagctggt ggagcggctg ggggctggac agttcgggga ggtgtggatg gggtactaca 108 0 

acgggcacac gaaggtggcg gtgaagagcc tgaagcaggg cagcatgtcc ccggacgcct 1140 

tcctggccga ggccaacctc atgaagcagc tgcaacacca gcggctggtt cggctctacg 1200 

ctgtggtcac ccaggagccc atctacatca tcactgaata catggagaat gggagtctag 1260 

tggattttct caagacccct tcaggcatca agttgaccat caacaaactc ctggacatgg 1320 

cagcccaaat tgcagaaggc atggcattca ttgaagagcg gaattatatt catcgtgacc 1380 

ttcgggctgc caacattctg gtgtctgaca ccctgagctg caagattgca gactttggcc 1440 

tagcacgcct cattgaggac aacgagtaca cagccaggga gggggccaag tttcccatta 1500 

agtggacagc gccagaagcc attaactacg ggacattcac catcaagtca gatgtgtggt 1560 

cttttgggat cctgctgacg gaaattgtca cccacggccg catcccttac ccagggatga 1620 

ccaacccgga ggtgattcag aacctggagc gaggctaccg catggtgcgc cctgacaact 1680 



3 



gtccagagga 
ccacctttga 
accagcctca 
ctgacttggg 
tgcacatgaa 
tagttgcgtg 
cactgtacaa 
aagcctggga 
gttcctcaag 
tggcacacat 
aa 



gctgtaccaa 
ctacctgcgc 
gccttgagag 
gagatggagt 
tcccacccac 
gactctgcac 
ggtacccctt 
ttgacagaag 
ggccaggact 
caggagttca 



ctcatgaggc 
agtgtgctgg 
gccttgagag 
tcttgtgcca 
atgtgacaca 
atgtcttgta 
tctggctctc 
ctftctgccca 
ttatctaata 
ataaatgtct 



tgtgctggaa 
aggacttctt 
gccctggggt 
tagtcacatg 
tatgcacctt 
catgtgtagc 
ccatttcctg 
cctacttttc 
cctctgtgtg 
gttgatgact 



ggagcgccca 
cacggccaca 
tctccccctt 
gcctatgcac 
gtgtctgtac 
ctgtgcatgt 
agaccacaga 
tttcctcaga 
ctcctccttg 
gttgtacaca 



gaggaccggc 
gagggccagt 
tctctccagc 
atatggactc 
acgtgtcctg 
atgtcttgga 
gagaggggag 
tcatccagaa 
gtgcctggcc 
acaacaaaac 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2282 



<210> 5 

<211> 932 

<212> DNA 

<213> Homo sapiens 



<400> 5 

gaggcagcag 

ccagttgcag 

agtgatttgc 

tggaagttct 

aaaccctgac 

agaaaccaat 

catggatgat 

ctcgaacgac 

ccattctgat 

agttttcact 

ttatggactg 

aacagacaca 

tcattcattc 

tatgaccatt 

tctctagatt 

agattaaaat 



cagcaggagg 
ccttctcagc 
ttttgcctcc 
gaggaaaagc 
ccatctcaga 
gactttaaac 
atggatgatg 
tctgatgatg 
gaatctgatg 
ccagttgtcc 
aggtcaaaat 
cctgatggtt 
attcatccat 
gaatacaaat 
tgattctcaa 
aaatggaaaa 



aggcagagca 
caaacgccga 
taggcatcac 
agctttacaa 
agcagaatct 
aagagaccct 
aagatgatga 
tagatgacac 
aactggtcac 
ccacagtaga 
ctaagaagtt 
ctgactagcg 
tcattcatcc 
ctttttctgc 
acacaattct 
caccaattat 



cagcatcgtc 
ccaaggaaaa 
ctgtgccata 
caaataccca 
cctagcccca 
tccaagtaag 
tgaccatgtg 
tgatgattct 
tgattttccc 
cacatatgat 
tcgcagacct 
ctcaagtcta 
attcagcaag 
ttggcggttt 
actttttgaa 
tt 



gggaccagac 
ctcactacca 
ccagttaaac 
gatgctgtgg 
cagaatgctg 
tccaacgaaa 
gacagccagg 
caccagtctg 
acggacctgc 
ggccgaggtg 
gacatccagg 
ggaaaccaca 
aattcattca 
ttgtaagtct 
atcctggatc 



tcgtctcagg 
tgagaattgc 
aggctgattc 
ccacatggct 
tgtcctctga 
gccatgacca 
actccattga 
atgagtctca 
cagcaaccga 
atagtgtggt 
taaatccttt 
gtttgcatat 
tattctactt 
acataatttc 
agttattttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

932 



<210> 6 

<211> 1314 

<212> DNA 

<213> Homo sapiens 



<400> 6 

agcgtcggac 

atatattttt 

tctagctctg 

actgccgtgg 

tggcgtgcga 

tggtggccgc 

tgaatgtgtg 

cgctctggac 

ccgctgtgga 

tatcgtctgc 

cgtgtgccct 

cagtttcgaa 

actgcgaaga 

gaggcccttc 

tatttgaacc 

tcgggggcag 

ccgctgccgc 

tgactgcaag 



taccgttggt 
ccgccttttc 
tgggaaggtt 
agccgcatcc 
caacgcggcg 
tcagcgccag 
agtcagaccc 
gcttt ccgca 
tgcagagctt 
aaacacccct 
cccctccccc 
ccctcacacg 
aagccggggc 
ccagcagcag 
ccctcccccc 
agtcgttgga 
cttccccatc 
ccccaccctc 



ttccgcaact 
tggaaggatt 
ttgggctctc 
actgtggatt 
cagaagatgc 
gatcgcctca 
ccttcccggg 
cgcttgtctt 
ctctgccgtt 
cccgccgtgg 
caccgtcacc 
gacgcctgga 
aacaaccagt 
aatctgttga 
cagcacaacc 
gactgaagag 
acggagggtc 
cttgagactg 



tcctggatta 
tcgctgcttc 
tggctcggat 
ataattgcaa 
agacggtgac 
cagtgggggt 
ctgggcgcgg 
gcatggagct 
ttgtggatcg 
gcctgtctcc 
agcttgcaga 
agagccacgg 
gggtccccta 
gttgctgcca 
cccccaaaac 
gaagaggagg 
cagactgtcc 
gagctggcgt 



tcctcgccaa 
ccgaagtctt 
tttgcaattt 
catgacgctg 
cgccgcggtg 
gtacgagtcg 
gtgggacggg 
gggacttccc 
ggggctgccg 
ccctacccca 
ggcaatcccc 
cttggtggag 
catctctctt 
caaacaaaaa 
aacccaaccc 
aggagaaggg 
actcgggggt 
ctgcatacga 



ggactttgca 
ggacgagcgc 
ctccctgggg 
gaagagctcg 
gaggagcttt 
gccaagttga 
acctcccctc 
caagtgcccc 
tatcctgatg 
tactttgaac 
tgcacccttg 
gtggccagct 
caggaacgct 
atacaataaa 
acgaggacca 
gagtgagcgg 
ggagtgagac 
gagacttggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



4 



tgaacttggt tggtccttgt ctgcaccctc gacaagacca cactttggga cttgggagct 1140 
ggggctgaag ttgctctgta cccatgaact cccagtttgc gaattataga gacaatctat 1200 
tttgttactt gcacttgtta ttcgaaccac tgagagcgag atgggaagca tagatatcta 1260 
tatttttatt tctactatga gggccttgta ataaatttct aaagcctcaa aaaa 1314 



<210> 7 

<211> 2130 

<212> DNA 

<213> Homo sapiens 

<400> 7 

cgctgtgtct gcccggtctc tcctggggtc tccggttctg ccgcctcctg tccctccttc 60 

cggctgcccc ggccagaggg gctcgaaggc gcgttggggc ccggggccac gctgggagac 120 

acgccttcgg ccgcgcctaa ttcgagccag gcgcgggacg cgtcccccgg ccgggcggag 180 

cgggagggga cgcccgccgc gctcggcctc cccgggcccc gacccctcct ttgtaatttg 240 

aataaaacgc ctcccccgcc cgcgcgccgc cttaacccgc cgcctccgct ctccccggct 300 

gcaggcggcg tgcaggacca gcggcggccg tgcaggcgga ggacttcggc gcggctcctc 360 

ctgggtgtga ccccgggcgc gcccgccgcg cgacgatgag ggcgcggccg caggtctgcg 420 

aggcgctgct cttcgccctg gcgctccaga ccggcgtgtg ctatggcatc aagtggctgg 480 

cgctgtccaa gacaccatcg gccctggcac tgaaccagac gcaacactgc aagcagctgg 540 

agggtctggt gtctgcacag gtgcagctgt gccgcagcaa cctggagctc atgcacacgg 600 

tggtgcacgc cgcccgcgag gtcatgaagg cctgtcgccg ggcctttgcc gacatgcgct 660 

ggaactgctc ctccattgag ctcgccccca actatttgct tgacctggag agagggaccc 720 

gggagtcggc cttcgtgtat gcgctgtcgg ccgccgccat cagccacgcc atcgcccggg 780 

cctgcacctc cggcgacctg cccggctgct cctgcggccc cgtcccaggt gagccacccg 840 

ggcccgggaa ccgctgggga agatgtgcgg acaacctcag ctacgggctc ctcatggggg 900 

ccaagttttc cgatgctcct atgaaggtga aaaaaacagg atcccaagcc aataaactga 960 

tgcgtctaca caacagtgaa gtggggagac aggctctgcg cgcctctctg gaaatgaagt 1020 

gtaagtgcca tggggtgtct ggctcctgct ccatccgcac ctgctggaag gggctgcagg 1080 

agctgcagga tgtggctgct gacctcaaga cccgatacct gtcggccacc aaggtagtgc 1140 

accgacccat gggcacccgc aagcacctgg tgcccaagga cctggatatc cggcctgtga 1200 

aggactcgga actcgtctat ctgcagagct cacctgactt ctgcatgaag aatgagaagg 1260 

tgggctccca cgggacacaa gacaggcagt gcaacaagac atccaacgga agcgacagct 1320 

gcgaccttat gtgctgctac gtcacctgcc gcaggtgtga gcgtaccgtg gagcgctatg 1380 

tctgcaagtg aggccctgcc ctccgcccca cgcaggagcg aggactctgc tcaaggaccc 1440 

tcagcaactg gggccagggg cctggagaca ctccatggag ctctgcttgt gaattccaga 1500 

tgccaggcat gggaggcggc ttgtgctttg ccttcacttg gaagccacca ggaacagaag 1560 

gtctggccac cctggaagga gggcaggaca tcaaaggaaa ccgacaagat taaaaataac 1620 

ttggcagcct gaggctctgg agtgcccaca ggctggtgta aggagcgggg cttgggatcg 1680 

gtgagactga tacagacttg acctttcagg gccacagaga ccagcctccg ggaaggggtc 1740 

tgcccgcctt cttcagaatg ttctgcggga ccccctggcc caccctgggg tctgagcctg 1800 

ctgggcccac cacatggaat cactagcttg ggttgtaaat gttttctttt gttttttgct 18 60 

ttttcttcct ttgggatgtg gaagctacag aaatatttat aaaacatagc tttttctttg 1920 

gggtggcact tctcaattcc tctttatata ttttatatat ataaatatat atgtatatat 1980 

ataatgatct ctattttaaa actagctttt taagcagctg tatgaaataa atgctgagtg 2040 

agccccagcc cgcccctgca gttcccggcc tcgtcaagtg aactcggcag accctggggc 2100 

tggcagaggg agctctccag tttccgggca 2130 



<210> 8 

<211> 1847 

<212> DNA 

<213> Homo sapiens 

<400> 8 

cgctgtgtct gcccggtctc tcctggggtc tccggttctg ccgcctcctg tccctccttc 60 
cggctgcccc ggccagaggg gctcgaaggc gcgttggggc ccggggccac gctgggagac 120 
acgccttcgg ccgcgcctaa ttcgagccag gcgcgggacg cgtcccccgg ccgggcggag 180 
cgggagggga cgcccgccgc gctcggcctc cccgggcccc gacccctcct ttgtaatttg 240 
aataaaacgc ctcccccgcc cgcgcgccgc cttaacccgc cgcctccgct ctccccggct 300 



5 



gcaggcggcg 
ctgggtgtga 
aggcgctgct 
cgctgtccaa 
agggtctggt 
tggtgcacgc 
ggaactgctc 
gggagtcggc 
ggtagtgcac 
gcctgtgaag 
tgagaaggtg 
cgacagctgc 
cgagcggtgc 
taccgtggag 
actctgctca 
tgcttgtgaa 
gccaccagga 
acaagattaa 
agcggggctt 
gcctccggga 
cctggggtct 
ttcttttgtt 
acatagcttt 
aatatatatg 
gaaataaatg 
tcggcagacc 



tgcaggacca 
ccccgggcgc 
cttcgccctg 
gacaccatcg 
gtctgcacag 
cgcccgcgag 
ctccattgag 
cttcgtgtat 
cgacccatgg 
gactcggaac 
ggctcccacg 
gaccttatgt 
cactgtaagt 
cgctatgtct 
aggaccctca 
ttccagatgc 
acagaaggtc 
aaataacttg 
gggatcggtg 
aggggtctgc 
gagcctgctg 
ttttgctttt 
ttctttgggg 
tatatatata 
ctgagtgagc 
ctggggctgg 



gcggcggccg 
gcccgccgcg 
gcgctccaga 
gccctggcac 
gtgcagctgt 
gtcatgaagg 
ctcgccccca 
gcgctgctga 
gcacccgcaa 
tcgtctatct 
ggacacaaga 
gctgcgggcg 
accactggtg 
gcaagtgagg 
gcaactgggg 
caggcatggg 
tggccaccct 
gcagcctgag 
agactgatac 
ccgccttctt 
ggcccaccac 
tcttcctttg 
tggcacttct 
atgatctcta 
cGcagcccgc 
cagagggagc 



tgcaggcgga 
cgacgatgag 
ccggcgtgtg 
tgaaccagac 
gccgcagcaa 
cctgtcgccg 
actatttgct 
cctcaagacc 
gcacctggtg 
gcagagctca 
caggcagtgc 
tggctacaac 
ctgctacgtc 
ccctgccctc 
ccaggggcct 
aggcggcttg 
ggaaggaggg 
gctctggagt 
agacttgacc 
cagaatgttc 
atggaafccac 
ggatgtggaa 
caattcctct 
ttttaaaact 
ccctgcagtt 
tctccagttt 



ggacttcggc 
ggcgcggccg 
ctatggcatc 
gcaacactgc 
cctggagctc 
ggcctttgcc 
tgacctggag 
cgatacctgt 
cccaaggacc 
cctgacttct 
aacaagacat 
ccctacacag 
acctgccgca 
cgccccacgc 
ggagacactc 
tgctttgcct 
caggacatca 
gcccacaggc 
tttcagggcc 
tgcgggaccc 
tagcttgggt 
gctacagaaa 
ttatatattt 
agctttttaa 
cccggcctcg 
ccgggca 



gcggctcctc 
caggtctgcg 
aagtggctgg 
aagcagctgg 
atgcacacgg 
gacatgcgct 
agagggaccc 
cggccaccaa 
tggatatccg 
gcatgaagaa 
ccaacggaag 
accgcgtggt 
ggtgtgagcg 
aggagcgagg 
catggagctc 
tcacttggaa 
aaggaaaccg 
tggtgtaagg 
acagagacca 
cctggcccac 
tgtaaatgtt 
tatttataaa 
tatatatata 
gcagctgtat 
tcaagtgaac 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
.1500 
1560 
1620 
1680 
1740 
1800 
1847 



<210> 9 

<211> 2369 

<212> DNA 

<213> Homo sapiens 



<400> 9 

gaattccgcc 

agcagcggcg 

gtccaacttt 

tgctgtgctc 

gcctaccatg 

catgcagatc 

ctactcgctg 

gaaacctcaa 

ggacatgaat 

caagcaggca 

gcccacattt 

gtgccgacaa 

cacaggatca 

catgccacac 

gaccctgctg 

gcatcataga 

tgaagcgctg 

gatcttcaga 

gccgccatgt 

gttggatgag 

atctctgcag 

aagttttggc 

ggccttaaag 

gagagttctt 

gaccaaggaa 

catccaaagc 

tcttggtctg 



agccccgcca 
ccgtgctcgc 
gactgcgggt 
gtcaaagagt 
tacccaccct 
attgtgaaag 
gctgagaggt 
ggccgaatgc 
gaatttgaga 
aaggtccacc 
tgctctgtct 
tgcaatgcag 
gctatcaata 
agatttaaag 
tggggactgg 
tgccagacaa 
gccatgattg 
gaaggtccgg 
ttaccgacac 
gtggataaaa 
attaaactaa 
aaggtcttcc 
aaagatgtgg 
tccttggcct 
aacctctttt 
tgccacaagt 
cagttccttc 



gtccccgcgc 
tccagggcgc 
cctgccagtc 
atgtcgaatc 
gggacagcac 
gcaaaaacgt 
gcaggaagaa 
taatgaatgc 
cggaaggctt 
acgtcaagtg 
gccacgagtt 
caattcacaa 
gccgagaaac 
tctacaatta 
cacggcaagg 
aggtggccaa 
agagcactca 
ttgaaattgg 
cgggaaaaag 
tgtgccatct 
aaattgagga 
tggcagaatt 
tcttgatgga 
gggagcatcc 
ttgtgatgga 
tcgacctttc 
attccaaagg 



agtccccgcg 
aaccatgtcg 
ttgtcagggc 
agagaacggg 
ttttgatgcc 
ggacctcatc 
caacgggaag 
aagatacttt 
ctttgctttg 
ccacgagttc 
tgtctggggc 
gaagtgtatt 
catgttccac 
caagagcccg 
actcaagtgt 
cctttgtggc 
acaggctcgc 
tctcccatgc 
agagcctcag 
tccagaacct 
ttttatcttg 
caagaaaacc 
cgatgatgtt 
gtttctgacg 
gtacctcaac 
cagagcgacg 
aatagtctac 



cagtcccagc 
ccatttcttc 
gaggctgtta 
cagatgtata 
catat caaca 
tctgaaacca 
acagaaatat 
ctggaaatga 
catcagcgcc 
actgccacct 
ctgaacaaac 
gataaagtta 
aaggagagat 
accttctgtg 
gatgcatgtg 
ataaaccaga 
tgcttaagag 
tccatcaaaa 
ggcatttcct 
gaactgaaca 
cacaaaatgt 
aatcaatttt 
gagtgcacga 
cacatgtttt 
ggaggggact 
ttttatgctg 
agggacctga 



gccaccgggc 
ggattggctt 
acccttactg 
tccagaaaaa 
agggaagagt 
ccgtggagct 
ggttagagct 
gtgacacaaa 
ggggtgccat 
tcttcccaca 
agggctacca 
tagcaaagtg 
tcaaaattga 
aacactgtgg 
gcatgaatgt 
agctaatggc 
atactgaaca 
atgaagcaag 
gggagtctcc 
aagaaagacc 
tggggaaagg 
tcgcaataaa 
tggtagagaa 
gtacattcca 
taatgtacca 
ctgaaatcat 
agctagataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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catcctgtta gacaaagatg gacatatcaa gatcgcggat tttggaatgt gcaaggagaa 1680 
catgttagga gatgccaaga cgaatacctt ctgtgggaca cctgactaca tcgccccaga 1740 
gatcttgctg ggtcagaaat acaaccactc tgtggactgg tggtccttcg gggttctcct 1800 
ttatgaaatg ctgattggtc agtcgccttt ccacgggcag gatgaggagg agctcttcca 1860 
ctccatccgc atggacaatc ccttttaccc acggtggctg gagaaggaag caaaggacct 1920 
tctggtgaag gtaagaagct gaaggtaaga agcgaagcca agagcgtctt cataagacga 1980 
gcattaggtc ttctggtcag ttttctgttc ctcttagttt ccaacttgca tgtggcaaac 2040 
aatgattatt attgaactgg gtttaaatgg gatgtgcacc gtctgtgttt-taatagaggc 2100 
accaatatta tgagcattaa atgtcaaaat gagtgtaaga gaaaccctca tgtgcatcag 2160 
ttataacata acggccccag gaaccagttc catggacctt gaatacgctc acctggagat 2220 
gtagttggtt cattaaacaa gcacagtgtg tggcttaaaa atcaatcttc tagctacttg 2280 
ggaggttgag gcaggaggat tgaagttgga agtgtgaggt cagcctgggc aatgtatcga 2340 
gacccctgtc tccaaaacaa taaagggga 2369 



<210> 10 

<211> 1101 

<212> DNA 

<213> Homo sapiens 



<400> 10 

ctccttagtc 

tgcccagtcc 

ccctcttcca 

ccctcgcgcg 

tgcgcatgag 

ggattaataa 

gggacatcaa 

aagcagggga 

actccctgcc 

tgcgagtgta 

agtccagccg 

ataccttctc 

ttccaggcta 

gtgctgtcag 

ccagtgatct 

atgaggatga 

ggcagccaac 

cctctgtcta 

agaaggcccc 



gaggcaagac 
acgccggccg 
ctcggagtcg 
cgaccagccg 
accctggcta 
agaggagatg 
caaggatgcc 
aaaggagcca 
agatatcgag 
ccggatgctt 
agatgctaag 
tgatggactc 
catgcaggac 
cagcactctc 
gtacaacttc 
ggaagggaaa 
aaacgtggat 
tggagacttt 
tggatcctta 



gtgcgcccga 
tgcccggcgg 
cgctccgcgc 
aatcgctcct 
gagatgcaga 
atcttccaga 
tgtttgttcc 
gatcccaaga 
gaggtgaaag 
ccacctctca 
agcaaggcca 
agcagctcca 
ttggaggtgg 
cccgactggc 
caggtgtcac 
ttacctgagg 
gggaaggggt 
agctgtaagg 
t 



gccccgccga 
ccttaagaac 
gccctcactg 
gcagcagagc 
ttaattccaa 
tcccatggaa 
ggagctgggc 
cgtggaaggc 
accagagcag 
ccaagaacca 
agaggaagtc 
ctctgcctga 
agcaggccct 
acatcccagt 
ccatgccctc 
acatcatgaa 
acctactcaa 
aggagccaga 



accgaggcca 
caggcaacct 
cagcccctgc 
caacatgccc 
ccaaatcccg 
gcatgctgcc 
cattcacaca 
caactttcgc 
gaacaagggc 
gagaaaagaa 
atgtggggat 
tgaccacagc 
gactccagca 
ggaagttgtg 
cacctctgaa 
gctcttggag 
tgaacctgga 
aattgacagc 



cccggagccg 

ctgccttctt 

gtcgccggga 

atcactcgga 

gggctcatct 

aagcatggct 

ggccgataca 

tgtgccatga 

agctcagctg 

agaaagtcga 

tccagccctg 

agctacacag 

ctgtcgccat 

ccggacagca 

gctacaacag 

cagtcggagt 

gtccagccca. 

ccagggggta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1101 



<210> 11 

<211> 4628 

<212> DNA 

<213> Homo sapiens 



<400> 11 

cctcttgcgg 

cgccggtgca 

ccggwcggga 

gcgcgccgct 

cgccccgcag 

ccgcagggcg 

gaggggggca 

ctccggggca 

gcgccgccgg 

gagcccgggg 

cggcggactc 

tcgagctcac 

gaactgggat 



ccacaggcgc 
gccgcagcgc 
ccgagacccc 
gcttgaaaag 
gcgcacggta 
atggagcccg 
caaggtctgg 
ccagctccgg 
gagtcgagcg 
gcggcgggcc 
tcccgaggcg 
tgtggagtat 
gtggagctgg 



ggcgtcctcg 
gcggaggaac 
tcgtagcgca 
ccgcggaacc 
cccgtgctgc 
gtctgcaagg 
agaccccggg 
ctccattgtt 
ccggccgcgg 
ggagccgggg 
gaacctccac 
ccatggagat 
aagtgcctcc 



gcggcgggcg 
ccgggtgtgc 
ttgcggcgac 
caaggacttt 
agtcgggcac 
aaagtgaggc 
tggcggacgg 
cccgcccggg 
agctcttgcg 
acgcgggcac 
gccgagcgag 
gtggagcctt 
tcttctgggc 



gcagctagcg 
cgggagctgg 
ctcgccttcc 
tctccggtcc 
gccgcgggcg 
gccgccgctg 
gagccctccc 
ctggaggcgc 
accccgccag 
acgcccgctc 
ggtcagtttg 
gtcaccaacc 
tgtgctggtc 



ggagccggga 
gcggccacgt 
ccggccgcga 
gagctcgggg 
ccgggggcct 
cgttctggag 
cccgccccgc 
cgagcaccga 
gacccgaaca 
gcacaagcca 
aaaaggagga 
tctaactgca 
acagccacac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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tctgcaccgc 
tggaagtgga 
gggacgatgt 
gcacccgcat 
tctatgcttg 
tttcagatgc 
agaaagaaac 
cagaaaagat 
gcccttccag 
aacctgacca 
actctgtggt 
gcatcaacca 
aagcagggtt 
aggtgtacag 
gcaagattgg 
ccaccgacaa 
agtatacgtg 
ttctggaagc 
tcatctattg 
agatgaagag 
ccaagagcat 
actctggggt 
caggggtctc 
tggtcttagg 
tcgggctgga 
cggacgcaac 
tcgggaagca 
atgtcatcgt 
ccccagggct 
aggacctggt 
agtgcataca 
agatagcaga 
ccaacggccg 
cccaccagag 
acttttcaag 
gctgtacatg 
gcagctggtg 
cctgtccatg 
ctcctcaggg 
ccgacaccca 
ccctccccag 
cctgtccgtc 
tgtgtggcct 
ggtgagaggt 
ggaccaacac 
aacaggcatg 
tcacccataa 
gggaggtcag 
gtggcttatt 
ggtgaggcga 
gggctctgta 
tgtcttttta 
tgggaggcat 
aaaaggtcat 
ctattttcag 
atgaaaaggg 
ctgtcccgag 
cctgcaccca 
aagaagatgt 
agtcaaggcc 
tgggctgggg 



taggccgtcc 
gtccttcctg 
gcagagcatc 
cacaggggag 
cgtaaccagc 
tctcccctcc 
agataacacc 
ggaaaagaaa 
tgggacccca 
cagaattgga 
gccctctgac 
cacataccag 
gcccgccaac 
tgacccgcag 
cccagacaac 
agagatggag 
cttggcgggt 
cctggaagag 
cacaggggcc 
tggtaccaag 
ccctctgcgc 
tcttctggtt 
tgagtatgag 
caaacccctg 
caaggacaaa 
agagaaagac 
taagaatatc 
ggagtatgcc 
ggaatactgc 
gtcctgcgcc 
ccgagacctg 
ctttggcctc 
actgcctgtg 
tgatgtgtgg 
ctgctgaagg 
atgatgcggg 
gaagacctgg 
cccctggacc 
gaggattccg 
gcccagcttg 
actccaccgt 
cctgtcccct 
gccttcaccc 
gcaaagaggc 
ccctccctgc 
caagtgagag 
gcccctcgca 
tgcttcgtgc 
aattccgata 
aggccaggtt 
tatagctatg 
aaagggtcgt 
cagttgctat 
atattttttg 
tttaggtccc 
tgggaatgtc 
ggcaccgcat 
gacctcacga 
ctgcttcgag 
cagcagccat 
agggggcacc 



ccgaccttgc 
gtccaccccg 
aactggctgc 
gaggtggagg 
agcccctcgg 
tcggaggatg 
aaaccaaacc 
ttgcatgcag 
aaccccacac 
ggctacaagg 
aagggcaact 
ctggatgtcg 
aaaacagtgg 
ccgcacatcc 
ctgccttatg 
gtgcttcact 
aactctatcg 
aggccggcag 
ttcctcatct 
aagagtgact 
agacaggtaa 
cggccatcac 
cttcccgaag 
ggagagggct 
cccaaccgtg 
ttgtcagacc 
atcaacctgc 
tccaagggca 
tacaacccca 
taccaggtgg 
gcagccagga 
gcacgggaca 
aagtggatgg 
tctttcgggg 
agggtcaccg 
actgctggca 
accgcatcgt 
agtactcccc 
tcttctctca 
ccaatggcgg 
cagctgtaac 
ttcctgctgg 
cactcagctc 
agatctttgc 
caccaggcac 
cttcctgagc 
ctctggtggc 
ctcgattgaa 
ctagtttgct 
gggggcagtg 
aagaaaacac 
taccagagat 
atattaaaaa 
ctacttttgc 
tcaataaaaa 
cactggaaag 
gctcccggcg 
cgcacctcct 
ggcaggaacc 
cgaccatgga 
tacctaggga 



ctgaacaagc 
gtgacctgct 
gggacggggt 
tgcaggactc 
gcagtgacac 
atgatgatga 
gtatgcccgt 
tgccggctgc 
tgcgctggtt 
tccgttatgc 
acacctgcat 
tggagcggtc 
ccctgggtag 
agtggctaaa 
tccagatctt 
taagaaatgt 
gactctccca 
tgatgacctc 
cctgcatggt 
tccacagcca 
cagtgtctgc 
ggctctcctc 
accctcgctg 
gctttgggca 
tgaccaaagt 
tgatctcaga 

tgggggcctg 

acctgcggga 
gccacaaccc 
cccgaggcat 
atgtcctggt 
ttcaccacat 
cacccgaggc 
tgccccatac 
catggacaag 
tgcagtgccc 
ggccttgacc 
cagctttccc 
tgagccgctg 
actcaaacgc 
cctcacccac 
caggagccgg 
acctctccct 
tgccagccac 
tgcctggagg 
tttctcctgt 
aggtgccttg 
ggtgacctct 
ttgctgacca 
ttgtggccct 
aaagtgtata 
ttacccatcg 
caaaaaagaa 
tgttttattt 
ttgctgctgc 
aagggacacc 
caggttcctt 
gcctctccgc 
ccatccatgc 
tggtttcctc 
ttagccacgg 



ccagccctgg 
gcagcttcgc 
gcagctggcg 
cgtgcccgca 
cacctacttc 
tgatgactcc 
agctccatat 
caagacagtg 
gaaaaatggc 
cacctggagc 
tgtggagaat 
ccctcaccgg 
caacgtggag 
gcacatcgag 
gaagactgct 
ctcctttgag 
tcactctgca 
gcccctgtac 
ggggtcggtc 
gatggctgtg 
tgactccagt 
cagtgggact 
ggagctgcct 
ggtggtgttg 
ggctgtgaag 
aatggagatg 
cacgcaggat 
gtacctgcag 
agaggagcag 
ggagtatctg 
gacagaggac 
cgactactat 
attatttgac 
cccggtgtgc 
cccagtaact 
tcacagagac 
tccaaccagg 
gacacccgga 
cccgaggagc 
cgctgactgc 
agcccctgct 
ctgcctacca 
ccacctcctc 
ttcatcccct 
gcagggagtg 
cggtttggtc 
tcctcagggc 
gccccagata 
aatgcctggt 
ggggcccagc 
aatctgagta 
ggtaagatgc 
aaaaaaggaa 
ttttaaatta 
ttcatttatc 
cacgggccct 
gtaacctctt 
tgcttttgga 
agtagaggcg 
caagggaaac 
ggtagagcta 



ggagcccctg 
tgtcggctgc 
gaaagcaacc 
gactccggcc 
tccgtcaatg 
tcttcagagg 
tggacatccc 
aagttcaaat 
aaagaattca 
atcataatgg 
gagtacggca 
cccatcctgc 
ttcatgtgta 
gtgaatggga 
ggagttaata 
gacgcagggg 
tggttgaccg 
ctggagatca 
atcgtctaca 
cacaagctgg 
gcatccatga 
cccatgctag 
cgggacagac 
gcagaggcta 
atgttgaagt 
atgaagatga 
ggtcccttgt 
gcccggaggc 
ctctcctcca 
gcctccaaga 
aatgtgatga 
aaaaagacaa 
cggatctaca 
ctgtggagga 
gcaccaacga 
ccaccttcaa 
agtacctgga 
gctctacgtg 
cctgcctgcc 
cacccacacg 
gggcccacca 
ggggccttcc 
tccacctgct 
cccagatgtt 
ggagccaatg 
tgttttgcct 
tacagcagta 
ggtggtgcca 
accagaggat 
cccaaactgg 
tatatttaca 
tcctggtggc 
aatgttttta 
tgttctaaac 
tatgggctgt 
ggggctaggt 
cttcctaggt 
aagtcagaaa 
ctgggcagag 
cggtggggtt 
cagtgattaa 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
.2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
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gaggaaagca agggcgtgag ccaccacgcc tggcattaat ttgtcctttt ttaagatatg 4500 

gggtctcact atgttgccca ggctggtctt gaactcctgg gctcaagtga tcctcctgcc 4560 

ttagcctccc aaagtgctgg gattacaggc gtgagccact gtgcccagcc tgatgtgttc 4620 

ttaattta 4628 



<210> 12 

<211> 1418 

<212> DNA 

<213> Homo sapiens 



<400> 12 

ccacgtgggg 

atcgggccgg 

ctgaacggca 

tctcgcgagt 

ccaagggcgt 

ggggatgagc 

gacaaagagg 

tcacccagtg 

gcctggatta 

cttcaagaga 

ctctctcatc 

ctttcccgcc 

cgaggtctgg 

aagggcctgg 

gaaattgacc 

cgccaaggct 

taccccaaca 

aatgttggtt 

gacagcaaga 

gcactgtgac 

ttccttcttt 

aatcctccaa 

gttttttgtt 

cacatgaaaa 



gctcagcgtg 
tgcagatacg 
agcttgagtc 
agggcaacgc 
tcgtaacggg 
tcaggacaga 
cagcaggaga 
gcaaacctgc 
agaagaaagg 
ccaaatgttc 
aatactggtc 
agtgcccact 
tacgactgga 
cttcccgaaa 
ttcgcaaccc 
tcggggaatt 
caccctatgc 
ggcgccttga 
tccgttccaa 
accaccccta 
aaacactctt 
ccaggctcct 
ttttaaaaaa 
taaagagcca 



cacccttctt 
gggttgctct 
aggaccctta 
ggtaaaaata 
aatgccgaag 
gccagaggcc 
gggcccagcc 
cacactcaag 
attagattgg 
agagaacaaa 
agctccttcg 
caaagtttct 
gtaccggcag 
gccccttgtg 
caaggggaac 
actgcaggct 
ctacaccttt 
ttactttttg 
ggccctcggc 
aatcactttg 
cagagaaatc 
gtgatagagt 
aaattgaaca 
tagtttcagc 



tgtgctcggg 
tttgctcata 
attaagatcc 
ttgcttcggt 
cgtgggaaaa 
aagaagagta 
ctgtatgagg 
atctgctctt 
gtaaaggaag 
ctaccagctg 
gacaaggaag 
tacggcatag 
cgctgggatg 
ctgtgtggag 
aaaaagaatg 
gtgccactgg 
tggacttata 
ttgtcccact 
agtgatcact 
agcctgggaa 
tgcattctat 
tcttttaagc 
aagactacta 
gtcgtacc 



ttaggaggag 
agaggggctt 
tcaattggct 
gggtgacgcg 
agggagcggt 
agacggccgc 
accccccaga 
ggaatgtgga 
aagccccaga 
aacttcagga 
ggtacagtgg 
catatgtacc 
aagcctttcg 
acctcaatgt 
ctggcttcac 
ctgacagctt 
tgatgaatgc 
ctctgttacc 
gtcctatcac 
ataagccccc 
ttctcatgta 
ccaagatttt 
atgactttgt 



ctaggctgcc 
cgctggcagt 
ggagggcaga 
gtacagctgc 
ggcggaagac 
aaagaaaaat 
tcagaaaacc 
tgggcttcga 
tatactgtgc 
gctgcctgga 
cgtgggcctg 
taatgcaggc 
caagttcctg 
ggcacatgaa 
gccacaagag 
taggcacctc 
tcgatccaag 
tgcattgtgt 
cctataccta 
tcaactacca 
taaaactagg 
ttatttgagg 
ttgaattatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1418 



<210> 13 

<211> 1558 

<212> DNA 

<213> Homo sapiens 



<400> 13 

ccacgtgggg 

atcgggccgg 

ctgaacggca 

tctcgcgagt 

ccaagggcgt 

ggggatgagc 

ggggtgtttg 

caaagaaaaa 

atcagaaaac 

atgggcttcg 

atatactgtg 

agctgcctgg 

gcgtgggcct 

agcatgatca 

catatgtacc 

aagcctttcg 

acctcaatgt 



gctcagcgtg 
tgcagatacg 
agcttgagtc 
agggcaacgc 
tcgtaacggg 
tcaggacagg 
tcattccctt 
tgacaaagag 
ctcacccagt 
agcctggatt 
ccttcaagag 
actctctcat 
gctttcccgc 
ggaaggccgg 
taatgcaggc 
caagttcctg 
ggcacatgaa 



cacccttctt 
gggttgctct 
aggaccctta 
ggtaaaaata 
aatgccgaag 
taagggaatg 
gatgtacgag 
gcagcaggag 
ggcaaacctg 
aagaagaaag 
accaaatgtt 
caatactggt 
cagtgcccac 
gtgattgtgg 
cgaggtctgg 
aagggcctgg 
gaaattgacc 



tgtgctcggg 
tttgctcata 
attaagatcc 
ttgcttcggt 
cgtgggaaaa 
aaatcagccc 
agccagaggc 
agggcccagc 
ccacactcaa 
gattagattg 
cagagaacaa 
cagctccttc 
tcaaagtttc 
ctgaatttga 
tacgactgga 
cttcccgaaa 
ttcgcaaccc 



ttaggaggag 
agaggggctt 
tcaattggct 
gggtgacgcg 
agggagcggt 
ttcttcctag 
caagaagagt 
cctgtatgag 
gatctgctct 
ggtaaaggaa 
actaccagct 
ggacaaggaa 
ttacggcata 
ctcgtttgtg 
gtaccggcag 
gccccttgtg 
caaggggaac 



ctaggctgcc 
cgctggcagt 
ggagggcaga 
gtacagctgc 
ggcggaagac 
aactgcggcg 
aagacggccg 
gaccccccag 
tggaatgtgg 
gaagccccag 
gaacttcagg 
gggtacagtg 
ggcgatgagg 
ctggtaacag 
cgctgggatg 
ctgtgtggag 
aaaaagaatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
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ctggcttcac 
ctgacagctt 
tgatgaatgc 
ctctgttacc 
gtcctatcac 
ataagccccc 
ttctcatgta 
ccaagatttt 
atgactttgt 



gccacaagag 
taggcacctc 
tcgatccaag 
tgcattgtgt 
cctataccta 
tcaactacca 
taaaactagg 
ttatttgagg 
ttgaattatc 



cgccaaggct 
taccccaaca 
aatgttggtt 
gacagcaaga 
gcactgtgac 
ttccttcttt 
aatcctccaa 
gttttttgtt 
cacatgaaaa 



tcggggaatt 
caccctatgc 
ggcgccttga 
tccgttccaa 
accaccccta 
aaacactctt 
ccaggctcct 
ttttaaaa^a 
taaagagcca 



actgcaggct 
ctacaccttt 
ttactttttg 
ggccctcggc 
aatcactttg 
cagagaaatc 
gtgatagagt 
aaattgaaca 
tagtttcagc 



gtgccactgg 
tggacttata 
ttgtcccact 
agtgatcact 
agcctgggaa 
tgcattctat 
tcttttaagc 
aagactacta 
gtcgtacc 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1558 



<210> 14 

<211> 1911 

<212> DNA 

<213> Homo sapiens 



<400> 14 

tgccgccgtc 

gcacccgcag 

ggagtgggcc 

ccacgacagc 

gcaggagatc 

gctcaccgag 

gaccatggaa 

cctgcagcag 

acttctggga 

ccttgcctgt 

cccgcacctc 

agcctctatc 

caatgctcag 

tcctgacctg 

gggctattct 

gggccagctg 

tgacacagag 

gcgctctaac 

atcagagggg 

aatgtacctg 

aggcaaactc 

gagttatttc 

cttaacgtgt 

ccagcgtctg 

aaaaaaaaag 

ttatatccac 

attttgttgc 

aacttctttt 

aaggaccaca 

atgtaacagc 

aggtttgtgt 

acttttatga 



ccgcccgcca 
cagcgcccgc 
atggagggcc 
ggcctggact 
cgcctcgagc 
gacggggact 
gtgatccgcc 
actccactcc 
gctggctgtg 
gagcagggct 
cactccatcc 
catggctacc 
gagccctgta 
gtgtcactcc 
ccctaccagc 
acactagaaa 
tcagagttca 
taatgaggtg 
tatctacata 
ccccccccct 
cctcagcctg 
cagtagtggc 
cttttttccc 
acgttatgag 
ttttattttt 
actgcacact 
ttttgtgaac 
aaacctcacc 
ttttatattt 
caggagtgtt 
taccctcctg 
tgtatattta 



gcgccccagc 
agctcgtccg 
cccgcgacgg 
ccatgaaaga 
cgcaggaggt 
cgttcctgca 
aggtgaaggg 
acttggctgt 
atcctgagct 
gcctggccag 
tgaaggctac 
tgggcatcgt 
atggccggac 
tgttgaagtg 
tcacctgggg 
accttcagat 
cggagttcac 
ccattccctt 
atgagtctct 
ttttttaata 
taaagttcat 
ctccccatcc 
cttgttttca 
cgcaaagggg 
ctaaaaaaag 
gcctggccca 
tttttgtagg 
tttgtggggt 
attgtgcttc 
aagcgttcag 
taaatggtgt 
ttaaacagat 



gaggaagcag 
cgccatgttc 
gctgaagaag 
cgaggagtac 
gccgcgcggc 
cttggccatc 
agacctggcc 
gatcaccaac 
ccgagacttt 
cgtgggagtc 
caactacaat 
ggagcttttg 
tgcccttcac 
tggggctgat 
ccgcccaagc 
gctgccagag 
agaggacgag 
caccctccca 
caaatttctg 
gcttactctt 
tatatttggg 
cggtagcttg 
gctgccctat 
ctgaaagaac 
aaaaaagaag 
aaacgtctta 
ggacgagaaa 
ttttggagaa 
gagtgactga 
tgatgtgggg 
acataatgta 
ttttacaaat 



cgcgcagccc 
caggcggccg 
gagcggctac 
gagcagatgg 
tcggagccct 
atccatgaag 
ttcctcaact 
cagccagaaa 
cgaggaaata 
ctgactcagt 
ggccacacgt 
gtgtccttgg 
ctcgcagtgg 
gtcaacagag 
acccggatac 
agtgaggatg 
gtgagtctgt 
ggcccctaga 
tgcataacca 
tttttaaaga 
ctatggagaa 
gcagagctcc 
gatgactgtg 
atggacttgt 
aaaaaattta 
ttgtggtagg 
gatcattgaa 
ggttatcaaa 
ccccagtggt 
tgaaaagtta 
ttgttggtaa 
ggaanaaaga 



gcggcccagc 
agcgccccca 
tggacgaccg 
tcaaggagct 
ggaagcagca 
aaaaggcact 
tccagaacaa 
ttgctgaggc 
cccccctaca 
cctgcaccac 
gtctacactt 
gtgctgatgt 
acctgcaaaa 
ttacctacca 
agcagcagct 
aggagagcta 
gaactcttcg 
gctgctcctt 
gtatcccaag 
gaagcattaa 
tggagtccaa 
gctccggaag 
tgtttggagg 
atatttgtac 
aagggtgtac 
atcagccctc 
attctgagaa 
aatttcatgg 
atcctgtgac 
ctacctgtca 
ttattttggt 
n 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1911 



<210> 15 

<211> 1476 

<212> DNA 

<213> Homo sapiens 

<400> 15 

tgccgccgtc ccgcccgcca gcgccccagc gaggaagcag cgcgcagccc gcggcccagc 60 
gcacccgcag cagcgcccgc agctcgtccg cgccatgttc caggcggccg agcgccccca 120 
ggagtgggcc atggagggcc cccgcgacgg gctgaagaag gagcggctac tggacgaccg 180 
ccacgacagc ggcctggact ccatgaaaga cgaggagtac gagcagatgg tcaaggagct 240 
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gcaggagatc 
gctcaccgag 
gaccatggaa 
cctgcagcag 
acttctggga 
ccttgcctgt 
cccgcacctc 
ccggactgcc 
gaagtgtggg 
ctggggccgc 
tcagatgctg 
gttcacagag 
atgagcgcaa 
tttttctaaa 
acactgcctg 
tgaacttttt 
tcacctttgt 
tatttattgt 
gtgttaagcg 
tcctgtaaat 
atttattaaa 



cgcctcgagc 
gacggggact 
gtgatccgcc 
actccactcc 
gctggctgtg 
gagcagggct 
cactccatcc 
cttcacctcg 
gctgatgtca 
ccaagcaccc 
ccagagagtg 
gacgagctgc 
aggggctgaa 
aaaagaaaaa 
gcccaaaacg 
gtaggggacg 
ggggtttttg 
gcttcgagtg 
ttcagtgatg 
ggtgtacata 
cagattttta 



cgcaggaggt 
cgttcctgca 
aggtgaaggg 
acttggctgt 
atcctgagct 
gcctggccag 
tgaaggctac 
cagtggacct 
acagagttac 
ggatacagca 
aggatgagga 
cctatgatga 
agaacatgga 
agaagaaaaa 
tcttattgtg 
agaaagatca 
gagaaggtta 
actgacccca 
tggggtgaaa 
atgtattgtt 
caaatggaan 



gccgcgcggc 
cttggccatc 
agacctggcc 
gatcaccaac 
ccgagacttt 
cgtgggagtc 
caactacaat 
gcaaaatcct 
ctaccagggc 
gcagctgggc 
gagctatgac 
ctgtgtgttt 
cttgtatatt 
atttaaaggg 
gtaggatcag 
ttgaaattct 
tcaaaaattt 
gtggtatcct 
agttactacc 
ggtaattatt 
aaagan 



tcggagccct 
atccatgaag 
ttcctcaact 
cagccagaaa 
cgaggaaata 
ctgactcagt 
ggtcaggagc 
gacctggtgt 
tattctccct 
cagctgacac 
acagagtcag 
ggaggccagc 
tgtacaaaaa 
tgtacttata 
ccctcatttt 
gagaaaactt 
catggaagga 
gtgacatgta 
tgtcaaggtt 
ttggtacttt 



ggaagcagca 
aaaaggcact 
tccagaacaa 
ttgctgaggc 
cccccctaca 
cctgcaccac 
cctgtaatgg 
cactcctgtt 
accagctcac 
tagaaaacct 
agttcacgga 
gtctgacgtt 
aaaagtttta 
tccacactgc 
gttgcttttg 
cttttaaacc 
ccacatttta 
acagccagga 
tgtgttaccc 
tatgatgtat 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1476 



<210> 16 

<211> 2913 

<212> DNA 

<213> Homo sapiens 



<400> 16 

aagcggcagg 

ttttcgtgtc 

gaagttacct 

ctggaaggag 

taccaagtgt 

acccgagaag 

agtcttccgg 

gacaacgaca 

gctgatgaga 

atccgggatg 

gcctgcatcg 

aatctggccc 

cgaggctcct 

gatggtgaat 

agcggagaat 

tgtgccaagt 

gaccgaggct 

tctgctcccc 

agccctcaga 

ggagctggtg 

cagaatggct 

acctttgaaa 

gtttctgtca 

aaagaagtca 

gtaatcctgg 

atagttcgga 

gttttccacg 

gttacaacca 

ctggtctctg 

agccggagac 

aatcaaggtg 

cgagagtttg 



agcagcgttg 
tcttcgggat 
tattggattc 
ggtgggagga 
gcaatgtgat 
gggctcagag 
gcgtcatggg 
aagagcgttt 
gcttcaccca 
tagggccatt 
ccctggtatc 
agtttcctga 
gtgtcaacaa 
ggctggtacc 
gccaagcttg 
gcccacccca 
ttttcagagc 
tgaacttgat 
atacaggtgg 
accccagcaa 
tgaagaccac 
tctgggctgt 
ctgtgaccac 
caagatacag 
aatatgaagt 
cagctgccag 
tgcgagccag 
acacagtgcc 
tctcgggcag 
ggagtaaata 
taagaacata 
ccaaagaaat 



gcaccggcga 
ttgcgacgct 
cagatctgtt 
agtgagtatc 
ggaacccagc 
ggtgtatatt 
gacttgcaag 
catcagagag 
agtggacatt 
aagcaaaaag 
agtccgtgtg 
caccatcaca 
ctcagaagag 
cattggcaac 
caaaattgga 
cagctactct 
tgacaacgat 
ttcaaatgtc 
ccgccaggac 
gtgccgaccc 
caaagtctcc 
gaatggagtg 
caaccaagca 
tgtggcactg 
caagtattat 
gaacacagat 
gacagcagct 
ttcccggatc 
tgtggtgctg 
cagtaaagcc 
tgtggacccc 
tgacgcatcc 



accatggctg 
gtcacaggtt 
cagggagaac 
atggatgaaa 
cagaataact 
gagattaaat 
gagacgttta 
aaccagtttg 
ggtgacagaa 
gggttttacc 
ttct'ataaaa 
ggggctgata 
aaagatgtgc 
tgcctatgca 
tattacaagg 
gtctgggaag 
gctgcctcta 
aacgagacat 
atttcctata 
tgtggaagtg 
atcactgacc 
tccaaatata 
gcaccatcat 
gcttggctgg 
gagaaggatc 
atcaaaggcc 
ggctatggag 
attggagatg 
gtggtaattc 
aaacaagaag 
tttacgtacg 
tgcattaaga 



ggattttcta 
ccagggtata 
ttgggtggat 
aaaatacacc 
ggctacgaac 
tcaccttgag 
acctgtacta 
tcaaaattga 
tcatgaagct 
tggcttttca 
agtgtccact 
cgtcttccct 
caaaaatgta 
acgctgggca 
ctctctccac 
gagccacctc 
tgccctgcac 
ctgtgaactt 
atgtggtatg 
gggtccacta 
tcctagctca 
accctaaccc 
ccattgcttt 
aaccagatcg 
agaatgagcg 
tgaaccctct 
acttcagtga 
gggctaactc 
tcattgcagc 
cggatgaaga 
aagatcccaa 
ttgaaaaagt 



tttcgcccta 
ccccgcgaat 
agcaagccct 
aatccgaacc 
tgattggatc 
ggactgcaat 
ctatgaatca 
caccattgct 
gaacaccgag 
ggatgtgggg 
cacagtccgc 
ggtggaagtt 
ctgtggggca 
tgaggagcgg 
ggatgccacc 
gtgcacctgt 
ccgtccacca 
ggaatggagt 
caagaaatgt 
caccccacag 
taccaattac 
agaccaatca 
ggtccaggct 
gcccaatggg 
aagctatcgt 
cacttcctat 
gcccttggag 
cacagtcctt 
ttttgtcatc 
gaaacatttg 
ccaagcagtg 
tataggagtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 
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ggtgaatttg gtgaggtatg cagtgggcgt ctcaaagtgc ctggcaagag agagatctgt 1980 
gtggctatca agactctgaa agctggttat acagacaaac agaggagaga cttcctgagt 2040 
gaggccagca tcatgggaca gtttgaccat ccgaacatca ttcacttgga aggcgtggtc 2100 
actaaatgta aaccagtaat gatcataaca gagtacatgg agaatggctc cttggatgca 2160 
ttcctcagga aaaatgatgg cagatttaca gtcattcagc tggtgggcat gcttcgtggc 2220 
attgggtctg ggatgaagta tttatctgat atgagctatg tgcatcgtga tctggccgca 2280 
cggaacatcc tggtgaacag caacttggtc tgcaaagtgt ctgattttgg catgtcccga 2340 
gtgcttgagg atgatccgga agcagcttac accaccaggg gtggcaagat tcctatccgg 2400 
tggactgcgc cagaagcaat tgcctatcgt aaattcacat cagcaagtga tgtatggagc 2460 
tatggaatcg ttatgtggga agtgatgtcg tacggggaga ggccctattg ggatatgtcc 252 0 
aatcaagata cctaacactg ccttgttgga tccaagctcc cctgaattct ctgctgtggt 2580 
atcagtgggc gattggctcc aggccattaa aatggaccgg tataaggata acttcacagc 264 0 
tgctggttat accacactag aggctgtggt gcacgtgaac caggaggacc tggcaagaat 2700 
tggtatcaca gccatcacgc accagaataa gattttgagc agtgtccagg caatgcgaac 2760 
ccaaatgcag cagatgcacg gcagaatggt tcccgtctga gccagtactg aataaactca 2820 
aaactcttga aattagttta cctcatccat gcactttaat tgaagaactg cacttttttt 2880 
acttcgtctt cgccctctga aattaaagaa atg 2913 



'<210> 17 
<211> 3695 
<212> DNA 
<213> Homo sapiens 



<400> 17 

ccgtttcctc 

ctcccgctcc 

acgagtgcgg 

cctgatcgtc 

gccgagcagc 

ttcggaatca 

aagcgccagc 

aatgaagagc 

gccaactgtg 

aaagctacct 

tgcagaggtt 

agctggcacc 

aaaaccaaga 

attggattaa 

cacagaatta 

tactcagctc 

tcacctactg 

attctgggac 

cagactccct 

cttccttcga 

tgtctttcaa 

tacctgcagc 

tcctggagct 

gggagtttaa 

agcccaagat 

tcatccacaa 

tgctggggtt 

actgaggtcg 

atcctgacca 

gccaagaagc 

tgtggcagca 

aagaggcacc 

gtttgagatt 

cttttgcctt 

gggaaggaaa 

attgttctca 



ccctcccctc 
tgaagagcgc 
tgtcgctcca 
tctggccggc 
caccgtcccg 
agaatatgga 
cagcctttga 
aaacactgca 
aattgccttt 
tcagtggctt 
cggcatgaat 
tgactttgtg 
aaagacagaa 
cagcaataca 
ccccaaaggc 
tgagcaggag 
ctctgtcagt 
gcccaaagac 
cctccagtcc 
gagcttcgaa 
ggattacatc 
tgtgctggcc 
gctatcagac 
gctcgccgac 
gaactacgag 
gacgtcgggg 
cacgcccgag 
ccgggaccac 
gctgctccga 
agtggcctta 
acggcacagc 
aggaagccgt 
ctcaaaggag 
ttgcaaccag 
caaccatgtc 
gaatccctta 



cactcggccg 
gccgcgtggg 
gctcagagct 
gccctcgccc 
accaagcgcc 
ccaggtagcc 
cacctttgat 
agaagtgcca 
gttaaccccg 
ctctgggcca 
ggccagatgc 
ggtgacattc 
gatcaatatg 
ttaggttttg 
ggcctcctgg 
tttcagatgt 
caggacttcc 
cacgactccc 
tggaacagcc 
gatgactgca 
caagagagga 
ggcttcacag 
aaatcctgcc 
cccgatgagg 
aagctgagcc 
aagcgctacg 
gaactgcacg 
cctgagccgg 
ggacccagga 
ttgcatccca 
taattctact 
cctggcgcct 
cgagcatgtc 
gaacagcaaa 
atttcagaag 
acagttgtat 



tccctccttc 
ggacggcccg 
cccggagccg 
tcgcccggcg 
ggccctgccc 
cctgtggcta 
gggtccctgt 
acaggcttgg 
tgcagcaagg 
ccaatgagtt 
tgtgtaacct 
tctgggaaca 
aagaaaattc 
gcacagagca 
acagcatgtg 
tccccaagtc 
caggcagcaa 
ctgagaacgg 
agtcgtcctt 
gccagtctct 
gtgacccggt 
gaagtggacc 
agtcattcat 
tggcccgccg 
ggggcttacg 
tgtaccgctt 
ccatcctggg 
ccccaggctc 
aaggcaggat 
aaccacgcct 
cacagtgctt 
ggcagtccgt 
gtggacacac 
tgcaaaaact 
ttagtttgta 
ttaacagaaa 



ctcctccctc 
gttacttcct 
cccggccagc 
cgcaccgagc 
gcagcggcag 
acagttacag 
ttgctgtttt 
attccatttc 
ctgtgatgag 
cagtctggtg 
tggcaaggaa 
tctggagcaa 
acacctcacc 
ggcgccctat 
tccggcctcc 
tcggctcagc 
cttgaatttg 
tgcggacagc 
gctggatgtg 
ctgcctcaat 
ggagcaaggc 
tattcagctg 
cagctggact 
gtggggaaag 
ctactattac 
cgtgtgcgac 
cgtccagccc 
gtggactgag 
tgaaaatgtc 
cttgaccagg 
ttaagtgaaa 
gggacgggat 
acagactatt 
ctttgagagg 
tatattataa 
ttgtatattg 



ctccctcctc 
ccagagactg 
gtccggcctc 
agccgcgggc 
gatgaatgat 
agggacactc 
tccttctcta 
tcatgactcc 
tcaagcctta 
aacgtgaatc 
cgctttctgg 
atgatcaaag 
tccgttcctc 
ggaatgcaga 
acacccagcg 
tccgtcagcg 
ctcaccaaca 
ttcgagagct 
caacgggttc 
aagccaacca 
aaaccagtta 
tggcagtttc 
ggagacggat 
aggaaaaata 
gacaagaaca 
ctccagaact 
gacacggagg 
tgggaagccc 
caggaaagtg 
ctgcctccct 
atggtcgaga 
ggttctggct 
tttagatttt 
gtaggagggt 
taatcttata 
taatttaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 
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taattatata 
aatagcacat 
taaagtatgt 
cctgcttttt 
ggttgtctgg 
tctgggctta 
gtatttttct 
atacacaaaa 
ctgaccaccc 
tattattttt 
atttttactg 
tttatctcag 
aggggtcttg 
tgtctcaacc 
gtgattctta 
gaaagagaaa 
ttgtagtcgt 
ggtcaagttg 
gggtggctga 
caagctggga 
agtatacata 
tttataatat 
tggctcattg 
tacaaagaag 
accattggca 
ctctcaataa 



actgtatttg 
atggaatttt 
attatttaca 
gtgttttttt 
caaggacttt 
tatctggcct 
tgacaaatgg 
tggattttat 
agccatttcc 
ggaaatatgc 
caaaagtaat 
ctctatgtaa 
atgaaacctc 
atcttaatag 
ttttcttgga 
gcatcggagc 
ttcaaaagag 
gtttcaaagt 
catctgtaaa 
agagcaaagc 
agcacaggat 
aaacagacat 
tatgtgaggt 
gagtttcatg 
tggtaataaa 
taaagtattt 



aaataagaat 
gcaaagattt 
tttaatagac 
gtttgtttgt 
gtacatttgg 
ctgctttctc 
catatgtttt 
tttttagttt 
ttcctgtgct 
aactcctttt 
tcactgggaa 
agttaaagtt 
ttgaacgaag 
catgctgctc 
cacagactat 
cattcattcg 
cacctgagtc 
ccgacaggct 
gaatcctcct 
cagagctgcg 
gtttttctca 
gtgactggga 
tgaaccacgt 
gggttcagcc 
cagatcttaa 
tgtttatata 



tcagacatct 
aatctgccaa 
ttacagggat 
ttgtttgttt 
gagtttttat 
ctttaattgt 
ccacttcttt 
aactgtgttt 
ccacgttctt 
cagagatcag 
aaaaaatgta 
actgtacaga 
cacagtttgt 
ctttttgctc 
tctgaggcac 
gagaaaacgt 
atgtgtattc 
tgtctgttta 
gtgatgaaac 
ctgcctcaat 
agagggatgt 
acatcttgct 
gaaattgcca 
taacagttat 
gtataaaaat 
aaaaa 



gaggttttat 
gggccgacta 
aaggcctgtg 
ttggggggtt 
gagaaactta 
aaagtaaaag 
gcatgcgttt 
ctccgacagc 
ctgtgtgatt 
gagggattta 
atttgtaaga 
gctgaaggac 
cccatctttg 
agtgtccaca 
agagcgggga 
tttgatcaaa 
ccggccttta 
ctagctgcgt 
tgaggaatcg 
acccacaaaa 
atttatcact 
gccaaaagaa 
atattaggct 
ggaaactaca 
tttgtaattg 



ttcatttttc 
agagaagttg 
gggggtaatc 
ttcttgcctt 
aatgttatta 
ctataaagca 
aagtcagttt 
tcacctctct 
aaaataagaa 
tgtagcagct 
aagctttatt 
ggggggcggt 
ttcactcgtg 
gcaagatgac 
cttaagatgg 
atggagactt 
taaatgaccc 
ggccttggac 
ggtggccggg 
gaccattccc 
tggacatctg 
tcctaggcag 
ggcttttatc 
gtccttataa 
ggcctttact 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640" 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3695 



<210> 18 

<211> 1535 

<212> DNA 

<213> Homo sapiens 

<400> 18 

attaattctg gctccacttg ttgctcggcc caggttgggg agaggacgga gggtggccgc 60 
agcgggttcc tgagtgaatt acccaggagg gactgagcac agcaccaact agagaggggt 120 
cagggggtgc gggactcgag cgagcaggaa ggaggcagcg cctggcacca gggctttgac 180 
tcaacagaat tgagacacgt ttgtaatcgc tggcgtgccc cgcgcacagg atcccagcga 240 
aaatcagatt tcctggtgag gttgcgtggg tggattaatt tggaaaaaga aactgcctat 300 
atcttgccat caaaaaactc acggaggaga agcgcagtca atcaacagta aacttaagag 360 
acccccgatg ctcccctggt ttaacttgta tgcttgaaaa ttatctgaga gggaataaac 420 
atcttttcct" tcttccctct ccagaagtcc attggaatat taagcccagg agttgctttg 480 
gggatggctg gaagtgcaat gtcttccaag ttcttcctag tggctttggc catatttttc 540 
tccttcgccc aggttgtaat tgaagccaat tcttggtggt cgctaggtat gaataaccct 600 
gttcagatgt cagaagtata tattatagga gcacagcctc tctgcagcca actggcagga 660 
ctttctcaag gacagaagaa actgtgccac ttgtatcagg accacat'gca gtacatcgga 720 
gaaggcgcga agacaggcat caaagaatgc cagtatcaat tccgacatcg acggtggaac 780 
tgcagcactg tggataacac ctctgttttt ggcagggtga tgcagatagg cagccgcgag 840 
acggccttca catacgccgt gagcgcagca ggggtggtga acgccatgag ccgggcgtgc 900 
cgcgagggcg agctgtccac ctgcggctgc agccgcgccg cgcgccccaa ggacctgccg 960 
cgggactggc tctggggcgg ctgcggcgac aacatcgact atggctaccg ctttgccaag 1020 
gagttcgtgg acgcccgcga gcgggagcgc atccacgcca agggctccta cgagagtgct 1080 
cgcatcctca tgaacctgca caacaacgag gccggccgca ggacggtgta caacctggct 1140 
gatgtggcct gcaagtgcca tggggtgtcc ggctcatgta gcctgaagac atgctggctg 1200 
cagctggcag acttccgcaa ggtgggtgat gccctgaagg agaagtacga cacactggtt 1260 
taaaaaacaa acatcgaaag gcttatgcca aatggaagat agaatataaa ataaaacgtt 1320 
acttgtatat tggtaagtgg tttcaattgt ccttcagata attcatgtgg agatttttgg 1380 
agaaaccatg acggatagtt taggatgact acatgtcaaa gtaataaaag agtggtgaat 1440 
tttaccaaaa ccaagctatt tggaagcttc aaaaggtttc tatatgtaat ggaacaaaag 1500 
gggaattctc ttttcctata tatgttcctt acaaa 1535 
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<210> 19 

<211> 3877 

<212> DNA 

<213> Homo sapiens 



<400> 19 

cattagatct 

ctatttctta 

cacaaacatt 

aaatattaat 

aagctttaca 

aacagaacct 

acatcattag 

ataaaaataa 

ttgtaccgtg 

ttataaactt 

cctcctccct 

ctgaggcgga 

ctcgggctgc 

aagctgacag 

gagcctgaca 

atcccagtca 

gacgccggcc 

gtgtggctca 

ccacccaatg 

attgtctggt 

aatgtaacag 

ctggcctctt 

atcaccgtga 

ggccgagctc 

gtcctggctg 

gccaccaact 

gactgggtgc 

gccatccgca 

ttggaaggcc 

gagctgacag 

atcgtacgtg 

gtctcttctc 

cctgtggtcc 

cttcgaaaga 

ggagagccag 

atcgaggcca 

gtgctcatcc 

tcagtgcggg 

atgctgaaag 

tgcatgaagg 

agggctaaag 

ctgcatgcct 

accctgatcc 

ttcatccacc 

gtggctgact 

ggagggctgg 

acacagaggc 

ggcagagggc 

cccgtctgac 

tgtgctggga 

gtggccagcc 

ctgaacccag 

gatggcagtg 

cctgctgctt 



ttacatgaaa 
ggatactaaa 
aaaggctccc 
agtttgcaat 
actcaaaaaa 
tcacaaaaga 
ccatgatgga 
aaggactgag 
ctcctgaggg 
aatgccctgg 
cctcgctcgc 
gcatggggcg 
tgctggcgga 
tgtctcaggg 
tccagtgggt 
gcgagcagca 
ggtactggtg 
cggtagaagg 
cccctttcca 
ggagaggaac 
gggtgaccca 
ctcgcacagc 
caaagctttc 
tgctacagtc 
ttgtggtccc 
acagcctcag 
cctttcagac 
cagattcagg 
ccctgggacc 
tggaggggac 
tgtgcgtctc 
atgaccgtgc 
ttggtgtgct 
gacggaaaga 
ccgttcactt 
cattggacag 
cagagcagca 
aggcccagct 
ctgacatcat 
agtttgacca 
gccgtctccc 
tcctgctcgc 
ggttcatggt 
gagacctggc 
tcggactctc 
ctgagcagcc 
ttttgctgct 
atcggggcca 
cccagcccag 
agcccggact 
tggcatcagt 
gcagctggca 
tggggagggc 
aagtgcatgc 



gtaaaattta 
agcactcaca 
tataaaaaaa 
acatatgtat 
tacaaagaaa 
agataagaat 
aatgcaaatt 
ctcaccaaac 
tataacatat 
caacctcaca 
gggccgggcc 
gccggggctc 
gtccgccgcc 
gcagccggtg 
gaaggatggg 
ctggatcggc 
ccaggtggag 
tgtgccattt 
actgtcttgt 
tacgaagatc 
gagcaccatg 
cactgttcac 
cagcagcaac 
ctgtacagtt 
tgtgcccccc 
ggtgcgctgt 
caagggtcta 
cctcatcttg 
ctacaaactg 
cagggccaat 
caatgcagtt 
aggccagcag 
aacggccctg 
gacgcggttt 
ccgggcagcc 
cttgggcatc 
gttcaccctg 
gaagcaagag 
tgcctcaagc 
tccacacgtg 
catccccatg 
ctcccggatt 
ggacattgcc 
tgctcggaat 
ccggaagatc 
agggcaggca 
gcagcaaggg 
tttggccggc 
acagcaaggt 
gaccaaatca 
ttaggccttg 
ggagtggggt 
aggtccagct 
attgagctgc 



taagatttct 
ttatagaaaa 
catttttaat 
gaaaaggaat 
atatttttct 
gtttaataaa 
taagtaccac 
attggtgagg 
tacaggattt 
cctatttact 
cggcatggtg 
ccgccgctgc 
gcaggtctga 
aagctcaact 
gctgtggtcc 
ttcctcagcc 
gatgggggtg 
ttcacagtgg 
gaggctgtgg 
gggggacccg 
ttttcctgtg 
cttcaagcac 
gctagtgtgg 
caggtgacac 
tttacctgcc 
gccaatgcct 
gccccagcca 
gagtgggaag 
tcctgggttc 
ttgacaggct 
ggctgtggac 
ggccctcctc 
gtgacggctg 
gggcaagcct 
cggtccttca 
agcgatgaac 
ggccggatgt 
gatggctcct 
gacattgaag 
gccaaacttg 
gtcatcttgc 
ggggagaacc 
tgcggcatgg 
tgcatgctgg 
tacagtgact 
gagcaccagc 
ctactgccac 
tctggtggcc 
gtggaggctc 
cccaatccca 
gcttgatgga 
ggttatgttt 
ctgtgggccc 
ctccagcctg 



agaaagtcaa 
aaaatcagtt 
aggcaagcca 
tgaatctaga 
tccaattggc 
catttgaagc 
ttcacatcca 
atgtggtaat 
ttttgaaaac 
taagaatgaa 
cggcgtcgcc 
cgctgccgcc 
agctcatggg 
gcagtgtgga 
agaacttgga 
tgaagtcagt 
aaaccgagat 
agccaaaaga 
gtccccctga 
ctccctctcc 
aagctcacaa 
tgcctgcagc 
cctggatgcc 
aggccccagg 
tgctccggga 
tggggccctc 
gcgctcccca 
aagtgatccc 
aagacaatgg 
gggatcccca 
cctggagtca 
acagccgcac 
ctgccctggc 
ttgacagtgt 
atcgagaaag 
taaaggaaaa 
tgggcaaagg 
ttgtgaaagt 
agttcctcag 
ttggggtaag 
ccttcatgaa 
cctttaacct 
agtacctgag 
cagaggacat 
gtcggtacat 
cagagagtcc 
acagtagctg 
actgagctgg 
ctgtggtagt 
gttcttcctg 
agtgggccag 
ccatggttac 
taccctcctg 
gtggcccagc 



aagatgataa 
aactatactc 
cagaaagggc 
atatttaaca 
aaattactta 
cataataatg 
caagaaaaag 
actgaaattc 
tagtggttcc 
agggccccgc 
gccgatggcg 
gccaccgcgg 
agccccggtg 
ggggatggag 
ccagttgtac 
ggagcgctct 
ctcccagcca 
tctggcagtg 
acctgttacc 
atctgtttta 
cctaaaaggc 
ccccttcaac 
aggtgctgat 
aggctgggaa 
cctggtgcct 
tccctatgct 
aaacctccat 
cgaggcccct 
aacccaggat 
aaaggacctg 
gccactggtg 
atcctgggta 
cctcatcctg 
catggcccgg 
gcccgagcgc 
actggaggat 
agagtttggt 
ggctgtgaag 
ggaagcagct 
cctccggagc 
gcatggggac 
acccctccag 
ctctcggaac 
gacagtgtgt 
actcaccccc 
cctcaatgag 
ttagcccaca 
ctgactaagc 
cctcccaagc 
caaccactct 
tcctggttgt 
catgggtgtg 
ctgagctgcc 
tattaccaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 
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cttggggttt 
tcccttttag 
tctgattccg 
gaccgggtca 
gcagcgtgcc 
gccggagagg 
caaaatatcc 
cctttagtgc 
aaatcaggcc 
tctggaatgc 
ttaaaataga 



aaatatccag 
gtgagggttg 
caccctgcct 
cacaaaggca 
gagccagcaa 
agttcaggaa 
taagactaac 
caacttcccc 
tgatgagggg 
atggggcggg 
aataaaattg 



gtgtgcccct 
gtaaggggtt 
acgccaggag 
tgctggagta 
gaggaagggg 
cccttctcca 
aaaggcagct 
tctaactgga 
gaattcctgg 
t cctagctgt 
aagactaaag 



ccaagtcaca 
ggtatctcag 
aagttgaggg 
cccagcctat 
tgctgtgagg 
tacccacaat 
gtgtctgagc 
cagcctcttc 
aacctggacc 
tagggacatt 
acctaaa 



aagagatgtc 
gtctgaatct 
gagcatgctt 
caggtgcccc 
cttgcccagg 
ctgagcacgc 
ccaacccttc 
tgtcccaagt 
ccagccttgg 
tccaagctgt 



cttgtaatat 
tcaccatctt 
ccctgcagct 
tcttccaaag 
agcaagtgag 
taccaaatct 
taaacggtga 
ctccagagag 

tagttgctgt 



3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3877 



<210> 20 

<211> 3532 

<212> DNA 

<213> Homo sapiens 

<400> 20 

ttcccctccc ctccccaccc ccttttttcg ctgctctccc gcttccccgc caatggagaa 60 

gcgagctgat gacaaatagc gggccgcgga gtccgcggga cttcgcacca ggagtaataa 120 

aacagaccca gagatcaagg agctggggag ggggcggggg aacagggagg gagagcgtgt 180 

gagcgtctgc gagtgtgtgg aggcggctgc tgtggcagcg caggcggctc ggctccggcc 240 

cggagcgcag cggaagccgc gagggatgca gcggcgggga ccttggccgg tggaggatgt 300 

ggaggtggaa gtggagcgga tggcgctccc caagagctcc gccacgcgag gtttcgggct 360 

cgtggttttg cttcctccgg ccaaggtccc cgccgctgca tccctcgcgg cttcccgtgc 420 

gctccgggcc ggagccgagc cgcctgcgct gccacagcca gcgcctccac acactcgcag 480 

acgctcacac gctctccctc cctgttcccc cgccccctcc ccagctcctt gatctctggg 540 

tctgttttat tactcctggt gcgagtcccg cggactccgg cccgctattt gtcatcagct 600 

cgctctccat tggcggggag cggagagcag cgaagaaggg ggtggggagg ggaggggaag 660 

ggaagggggt ggaaactgcc tggagccgtt tctcgcgccg ctgttggtgc tgccgctgcc 720 

tcctcctcct ccgccgccgc cgccgccgcc gccgccgcct cctccggctc ttcgctcggc 780 

ccctctccgc ctccatgtgc cggatagcgg gagcgctgcg gaccctgctg ccgctgctgg 840 

cggccctgct tcaggcgtct gtagaggctt ctggtgaaat cgcattatgc aagactggat 900 

ttcctgaaga tgtttacagt gcagtcttat cgaaggatgt . gcatgaagga cagcctcttc 960 

tcaatgtgaa gtttagcaac tgcaatggaa aaagaaaagt acaatatgag agcagtgagc 1020 

ctgcagattt taaggtggat gaagatggca tggtgtatgc cgtgagaagc tttccactct 1080 

cttctgagca tgccaagttc ctgatatatg cccaagacaa agagacccag gaaaagtggc 1140 

aagtggcagt aaaattgagc ctgaagccaa ccttaactga ggagtcagtg aaggagtcag 1200 

cagaagttga agaaatagtg ttcccaagac aattcagtaa gcacagtggc cacctacaaa 1260 

ggcagaagag agactgggtc atccctccaa tcaacttgcc agaaaactcc aggggacctt 1320 

ttcctcaaga gcttgtcagg atcaggtctg atagagataa aaacctttca ctgcggtaca 1380 

gtgtaactgg gccaggagct gaccagcctc caactggtat cttcattatc aaccccatct 1440 

cgggtcagct gtcggtgaca aagcccctgg atcgcgagca gatagcccgg tttcatttga 1500 

gggcacatgc agtagatatt aatggaaatc aagtggagaa ccccattgac attgtcatca 1560 

atgttattga catgaatgac aacagacctg agttcttaca ccaggtttgg aatgggacag 1620 

ttcctgaggg atcaaagcct ggaacatatg tgatgaccgt aacagcaatt gatgctgacg 1680 

atcccaatgc cctcaatggg atgttgaggt acagaatcgt gtctcaggct ccaagcaccc 1740 

cttcacccaa catgtttaca atcaacaatg agactggtga catcatcaca gtggcagctg 1800 

gacttgatcg agaaaaagtg caacagtata cgttaataat tcaagctaca gacatggaag 1860 

gcaatcccac atatggcctt tcaaacacag ccacggccgt catcacagtg acagatgtca 1920 

atgacaatcc tccagagttt actgccatga cgttttatgg tgaagttcct gagaacaggg 1980 

tagacatcat agtagctaat ctaactgtga ccgataagga tcaaccccat acaccagcct 2040 

ggaacgcagt gtacagaatc agtggcggag atcctactgg acggttcgcc atccagaccg 2100 

acccaaacag caacgacggg ttagtcaccg tggtcaaacc aatcgacttt gaaacaaata 2160 

ggatgtttgt ccttactgtt gctgcagaaa atcaagtgcc attagccaag ggaattcagc 2220 

acccgcctca gtcaactgca accgtgtctg ttacagttat tgacgtaaat gaaaaccctt 2280 

attttgcccc caatcctaag atcattcgcc aagaagaagg gcttcatgcc ggtaccatgt 2340 

tgacaacatt cactgctcag gacccagatc gatatatgca gcaaaatatt agatacacta 2400 

aattatctga tcctgccaat tggctaaaaa tagatcctgt gaatggacaa ataactacaa 2460 

ttgctgtttt ggaccgagaa tcaccaaatg tgaaaaacaa tatatataat gctactttcc 2520 
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ttgcttctga 
ttgatattaa 
cagaccccaa 
catttgcttt 
ggcttaatgg 
atgaagttcc 
tgcgcgtgaa 
tgggtgcggg 
ttatccttgt 
aacaactttt 
aaggtggagg 
tggagcctga 
ccgagcccca 
ttaatgagaa 
atgatttcag 
acatcagaga 
cctatgtact 



caatggaatt 
tgacaatgcc 
ttcaattaat 
tgatcttcct 
tgattttgct 
catcataatc 
ggtttgccag 
gcttggcacc 
gctgatgttt 
aattgatcca 
agaagaagac 
tgccatcaag 
gtatccggtc 
aacctggccc 
caatgcatct 
agagtgccag 
tgcattttca 



cctcctatga 
cctcaagtgt 
attacagcac 
ttatctccag 
cagcttaatt 
acagattcgg 
tgtgactcca 
ggtgccatca 
gtggtatgga 
gaagatgatg 
caggactatg 
cctgtgggaa 
cgatctgcag 
atacagagct 
atcatgcctg 
ggtcaacaat 
gcaatattgc 



gtggaacagg 
tacctcaaga 
ttgattatga 
tgactattaa 
taaagataaa 
gtaatcctcc 
acggggactg 
ttgccatcct 
tgaaacgccg 
taagagataa 
acttgagcca 
tccgacgaat 
ccccacaccc 
tacacctatg 
cctttggacc 
taatatttta 
agcatagtat 



aacgctgcag 
ggcagagact 
cattgatcca 
gagaaattgg 
atttcttgaa 
caaatcaaat 
cacagatgtg 
gctctgcatc 
ggataaagaa 
tattttaaaa 
gctgcagcag 
ggatgaaaga 
tggagacatt 
accttggctt 
taatgagtat 
gagttacaac 
tatttatctc 



atctatttac 
tgcgaaactc 
aatgctggac 
accatcactc 
gctggtatct 
atttccatcc 
gacaggattg 
atcatcctgc 
cgccaggcca 
tatgatgaag 
cctgacactg 
cccatccacg 
ggggacttca 
cgtgggcacc 
gaaccacatt 
tacatgtgaa 
ta 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3532 



<2X0> 21 

<211> 3069 

<212> DNA 

<213> Homo sapiens 



<400> 21 

agattatgat 

ttcctttttt 

aagtccaaac 

gaggccccgg 

tctgctggag 

ctcttcattc 

gagccgggca 

acaeaatgag 

aggcatgaag 

gtttagaacg 

ttggtttgct 

gccagcacca 

agctgcaggg 

cttcagatcg 

tctcccatgg 

gcagcctgcc 

tcacttcata 

caatacaaac 

aacaaaacaa 

agcaaaaagt 

aatcacaatt 

gagaagagga 

gagtagacac 

aaaataaaaa 

taatgagcta 

ctttattatt 

ctcccatgaa 

ctaagcaagc 

tcctgctgcc 

cctatgagta 

tgggtcagaa 

ttatttcaga 

gcttatccat 

catctatttt 

gcacaaaatg 

gaatcattac 



cgcctgaggc 
tttttttttt 
agccaggggt 
tcgccacccg 
gctgccgagt 
ccgtccatgc 
cagaaacaca 
ctggaaaaga 
atgcttttga 
cttaaaagtt 
caacaaagcc 
gctcgagaat 
tcctcaggag 
ttctgattca 
agaagtggac 
gagtattggg 
gaacccagca 
aatctcttaa 
aactatactt 
ggggcagagc 
tttaatggca 
cattggagat 
atttaaggat 
ctattcgaat 
ttaaaactca 
attttttttt 
gccttttgaa 
taaccttatc 
cttaggccac 
tggattgggg 
gagttgtgca 
ttttgataac 
aaaaaaaaat 
gatgcagcat 
aatgaggtct 
cttttaccag 



ccctctccta 
tttaagtaat 
aaggtctcca 
cggtgcccat 
ttttggagcg 
cgagcccccg 
gcagcgggag 
atcgcttgaa 
aggctattgc 
ctgattccac 
aaagcacaca 
ttggaacgag 
atggaacgaa 
gagcgagagg 
aatataagta 
agtgacgagg 
tgacataaca 
attgggttca 
gaacaaaagg 
tcccaaggag 
gcagaaaact 
gccatcctct 
ggggttatga 
gaaaagacaa 
gcctgggaca 
ttttgaaaaa 
agggatcatc 
ctgcattgtt 
atgggagcag 
gggccaaaag 
cgcagattag 
tgtttatatg 
agatgtcttt 
ttgataatga 
gggctaggta 
cttttaacca 



cccagatacc 
taagggtagt 
agaggccttc 
ggagcgggtg 
ccgggagcga 
actgcagcat 
cagcaacacc 
gaattggttg 
agtaatctag 
taggaccaga 
tcaagaaact 
aacagagatt 
tacgaatgga 
agattgaagt 
ccaccagcat 
gttactccag 
gtgcagggca 
tgatgcagtc 
gtcagaggac 
aacaaatatt 
tgtgtgaaat 
ttctcttttc 
acccttcctg 
gaaaatcagg 
gtttatcatg 
agctcatttc 
atgcagctca 
agcactaggc 
tccatgcatg 
gaaaaagctc 
caggccaagg 
tgttgaaaac 
tatagtggaa 
taaaacacct 
gaaaaagggt 
tctgatatct 



gatgttatac 
taaattattt 
ccagggtaag 
aagatgatca 
gagtgtgaac 
tcaaagcccc 
agcactgcca 
gttgggagaa 
acgagctcat 
ctgcacccgg 
tgaagaagct 
tttaaagtgg 
cagcattgga 
ggatgttgaa 
cagtgacatt 
tgccagtgtc 
aaatattcac 
tcctctttaa 
ctgtatttaa 
cagaatattc 
tttcttgatt 
tagtttgctc 
agctttatgg 
tattaatctt 
aagcctgtgg 
atgctctgca 
actttctgtt 
acccagctgc 
acagcctcta 
catgtgcctc 
tctgagccac 
caaaatgaca 
aaacacatgg 
cacacctcac 
caatgctatt 
atagtagaca 



tgatgtgttt 
aaagtataca 
ggagtgcgga 
acgtgcagcg 
atggctacgc 
cacggaggtt 
acagatctac 
gggataccag 
ctgcgccttt 
cacacaacac 
gaaagaaaaa 
cgactggaac 
tcaactattt 
agcacagagt 
gatgaccaca 
aaactttcat 
tgggccaatt 
aacaaaacaa 
gcaaatactt 
atattggaaa 
tgagttgatt 
atactacatt 
tcctaaaagc 
ggatagctaa 
atgatcaatc 
aaaggagaga 
ggattccatg 
cacctctcca 
tcctacaagg 
tttgtctgcg 
agcagcattt 
tctttttaaa 
ggaaaaaaat 
tctttatagt 
tttgttttta 
cactatcata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 
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gttaacatag 
ctacaggcag 
gaatattttc 
atgtgttccc 
gaaggcctct 
actaccctta 
gattttaatg 
gcaaggaaaa 
gttagactat 
accatcttgg 
gaagtttttc 
tgaaaaactg 
gacttaatga 
tttcacttat 
atcaagtgtg 
aagaaacaa 



ttaagttcag 
tctctctctt 
agtaatgtta 
tttgtctttc 
tggagacctt 
attacttaaa 
gattgttaat 
agaaaataaa 
ccctttgtga 
tcatacatta 
cttgattgtg 
catgccttta 
aaaacaggac 
aattactaat 
agacagtttg 



cacttgtctc 
cctcacagtc 
ttttcttcta 
aaactccaag 
acccctggct 
aaaaaaaaaa 
ttcagtcctg 
gacaaccata 
gtgacacttt 
cgcctcaaca 
ctatgtttca 
gaagcccagt 
aaaacaattc 
ggctgagtca 
aagactgtgc 



attttaatgt 
ccactgtgca 
agtgaaattt 
gttcccctgt 
gtttggactt 
aaagctttat 
tagttttatt 
tttagcagtg 
aacagcattc 
tatacttgtg 
gtggaagaaa 
atcagaactt 
ctttttgtgg 
agatgttgtc 
taccatacaa 



aaagatttgc 
ggtgctattg 
ctagcctgca 
ggccctctcc 
tgtatacttt 
gattttcata 
ttatgtttag 
cagttgagtt 
actgcttcta 
ctcttccttt 
ttctttgaag 
gctacgtttc 
cccaggtaaa 
tctgtgtttg 
agtgaatgaa 



ttccattttc 
ttactcttac 
ctttgatgtc 
cttaccctgg 
aaataattta 
acttattgct 
atagggctgg 
gtgtgttaat 
tatatagtgt 
gcctccagaa 
tagatgtgag 
aggtgctagg 
ttatttctgg 
cttactcttg 
gccagtgact 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3069 



<210> 22 

<211> 2971 

<212> DNA 

<213> Homo sapiens 

<400> 22 

cggccgcgca aggaggcgcg ctgcgagggc gcggggctgg cccccgccgc gcccccggct 60 

gtgccccccg ccgtggccgc gccccagccc ccggccctgc ccgaggaccc cgctggggcc 120 

aagcccaggt gccccttctc agacattttc aacaccagcg agaactcgat ggagaagcac 180 

atcaacactt ttctgcagaa cgtgcagatt ctgctcgagg ccgccagcta cctggagcag 240 

atcgagaaag aaaacaaaaa gtgtgaacat ggctacgcct cttcattccc gtccatgccg 300 

agcccccgac tgcagcattc aaagccccca cggaggttga gccgggcaca gaaacacagc 360 

agcgggagca gcaacaccag cactgccaac agatctacac acaatgagct ggaaaagaat 420 

cgacgagctc atctgcgcct ttgtttagaa cgcttaaaag ttctgattcc actaggacca 480 

gactgcaccc ggcacacaac acttggtttg ctcaacaaag ccaaagcaca catcaagaaa 540 

cttgaagaag ctgaaagaaa aagccagcac cagctcgaga atttggaacg agaacagaga 600 

tttttaaagt ggcgactgga acagctgcag ggtcctcagg agatggaacg aatacgaatg 660 

gacagcattg gatcaactat ttcttcagat cgttctgatt cagagcgaga ggagattgaa 720 

gtggatgttg aaagcacaga gttctcccat ggagaagtgg acaatataag taccaccagc 780 

atcagtgaca ttgatgacca cagcagcctg ccgagtattg ggagtgacga gggttactcc 840 

agtgccagtg tcaaactttc attcacttca tagaacccag catgacataa cagtgcaggg 900 

caaaatattc actgggccaa ttcaatacaa acaatctctt aaattgggtt catgatgcag 960 

tctcctcttt aaaacaaaac aaaacaaaac aaaactatac ttgaacaaaa gggtcagagg 1020 

acctgtattt aagcaaatac ttagcaaaaa gtggggcaga gctcccaagg agaacaaata 1080 

ttcagaatat tcatattgga aaaatcacaa tttttaatgg cagcagaaaa cttgtgtgaa 1140 

attttcttga tttgagttga ttgagaagag gacattggag atgccatcct ctttctcttt 1200 

tctagtttgc tcatactaca ttgagtagac acatttaagg atggggttat gaacccttcc 1260 

tgagctttat ggtcctaaaa gcaaaataaa aactattcga atgaaaagac aagaaaatca 1320 

ggtattaatc ttggatagct aataatgagc tattaaaact cagcctggga cagtttatca 1380 

tgaagcctgt ggatgatcaa tcctttatta ttattttttt ttttttgaaa aaagctcatt 1440 

tcatgctctg caaaaggaga gactcccatg aagccttttg aaagggatca tcatgcagct 1500 

caactttctg ttggattcca tgctaagcaa gctaacctta tcctgcattg ttagcactag 1560 

gcacccagct gccacctctc catcctgctg cccttaggcc acatgggagc agtccatgca 1620 

tgacagcctc tatcctacaa ggcctatgag tatggattgg gggggccaaa aggaaaaagc 1680 

tccatgtgcc tctttgtctg cgtgggtcag aagagttgtg cacgcagatt agcaggccaa 1740 

ggtctgagcc acagcagcat ttttatttca gattttgata actgtttata tgtgttgaaa 1800 

accaaaatga catcttttta aagcttatcc ataaaaaaaa atagatgtct tttatagtgg 1860 

aaaaacacat ggggaaaaaa atcatctatt ttgatgcagc atttgataat gataaaacac 1920 

ctcacacctc actctttata gtgcacaaaa tgaatgaggt ctgggctagg tagaaaaagg 1980 

gtcaatgcta tttttgtttt tagaatcatt accttttacc agcttttaac catctgatat 2040 

ctatagtaga cacactatca tagttaacat agttaagttc agcacttgtc tcattttaat 2100 

gtaaagattt gcttccattt tcctacaggc agtctctctc ttcctcacag tcccactgtg 2160 

caggtgctat tgttactctt acgaatattt tcagtaatgt tattttcttc taagtgaaat 2220 



17 



ttctagcctg 
gtggccctct 
tttgtatact 
atgattttca 
ttttatgttt 
tgcagttgag 
tcactgcttc 
tgctcttcct 
aattctttga 
ttgctacgtt 
ggcccaggta 
tctctgtgtt 
aaagtgaatg 



cactttgatg 
cccttaccct 
ttaaataatt 
taacttattg 
agatagggct 
ttgtgtgtta 
tatatatagt 
ttgcctccag 
agtagatgtg 
tcaggtgcta 
aattatttct 
tgcttactct 
aagccagtga 



tcatgtgttc 
gggaaggcct 
taactaccct 
ctgattttaa 
gggcaaggaa 
atgttagact 
gtaccatctt 
aagaagtttt 
agtgaaaaac 
gggacttaat 
ggtttcactt 
tgatcaagtg 
ctaagaaaca 



cctttgtctt 
cttggagacc 
taattactta 
tggattgtta 
aaagaaaata 
atccctttgt 
ggtcatacat 
tccttgattg 
tgcatgcctt 
gaaaaacagg 
ataattacta 
tgagacagtt 
a 



tcaaactcca 
ttacccctgg 
aaaaaaaaaa 
atttcagtcc 
aagacaacca 
gagtgacact 
tacgcctcaa 
tgctatgttt 
tagaagccca 
acaaaacaat 
atggctgagt 
tgaagactgt 



aggttcccct 
ctgtttggac 
aaaaagcttt 
tgtagtttta 
tatttagcag 
ttaacagcat 
catatacttg 
cagtggaaga 
gtatcagaac 
tcctttttgt 
caagatgttg 
gctaccatac 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2971 



<210> 23 
<211> 1394 
<212> DNA 
<213> Homo 

<400> 23 

actcactata 

tcggccccgs 

gcagcatgga 

ccaagaggaa 

cacccccccg 

tggaggctga 

agaaggtgcg 

tgaactcaca 

actgtttcta 

gcatggagct 

tgacaattcc 

atctgcacag 

acaaggaggg 

tggccaagac 

agctgaacca 

ttgagatggc 

agcaggtggt 

tggacttcac 

tgattggagc 

tctgctcagg 

ggagccttcc 

gtgttgtttt 

ctttccctga 

caatggatct 



sapiens 



gggctcgagc 
tsgagyccry 
gtcgcccgcc 
gaaggatcta 
gaacctggac 
tgacttggtg 
gcacgcccag 
ggagcagaag 
cactgtcacc 
catggacaca 
agaggacatc 
caagctgtcg 
ccatgtgaag 
gatggatgcc 
gaagggctac 
catcctgcgg 
ggaggagccg 
tgctcagtgc 
tgatcgcttc 
ctggggtcca 
agctcactct 
taaaaaaaga 
tgggttgggg 
cgag 



ggccgcccgg 
wgtsctctaa 
tcgagccagc 
cggatatcct 
tcccggacct 
accatctcag 
agcggcacca 
cggctgctca 
ttctacgggg 
tccttggaca 
cttggggaga 
gtgatccaca 
atgtgtgact 
ggctgcaagc 
aatgtcaagt 
ttcccttacg 
tccccccagc 
ctgaggaaga 
tcccccaccc 
gtgggagggg 
ccctgaggac 
aaatatattt 
cagttacctg 



gcaggtgtct 
wkagtctcca 
ccgccagcat 
gcatgtccaa 
tcatcaccat 
aactgggccg 
tcatggccgt 
tggacctgga 
cactattcag 
agttctaccg 
ttgctgtgtc 
gagatgtgaa 
ttggcatcag 
cctacatggc 
ccgacgtctg 
agtcctgggg 
tcccagccga 
accccgcaga 
cctagggtac 
cccaagatct 
tggcttgaca 
ttttgaaaaa 
gttgctgttt 



gcccgcagcc 
ccgccgtcca 
gccccagtcc 
gccacccgca 
tggagacaga 
tggagcctat 
gaagcggatc 
catcaacatg 
agagggagac 
gaaggtgctg 
tatcgtgcgg 
gccctccaat 
tggctacttg 
ccctgagagg 
gagcctgggc 
gaccccgttc 
ccgtttctcc 
gcgtatgagc 
cagcaggcag 
ctgctcagag 
ggggctatgg 
acgactgccc 
taattaaaaa 



atgagcgtgc 
ggacccactt 
aaaggaaaat 
cccaacccca 
aactttgagg 
ggggtggtag 
cgggccaccg 
cgcacggtcg 
gtgtggatct 
gataaaaaca 
gccctggagc 
gtccttatca 
gtggactctg 
atcaacccag 
atcaccatga 
cagcagctga 
cccgagtttg 
tacctggagc 
agccttgccc 
aagtgcaggg 
gtttgctttg 
atcccgggtc 
ctttagagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1394 



<210> 24 

<211> 1717 

<212> DNA 

<213> Homo sapiens 



<400> 24 

tcccttggcc 

ctgggtgaga 

aatacagcaa 

gagaaaggcg 

agaaagcagg 

gggaggcggg 

agaggcgcga 

gcccgagcct 



ccaggagaca 
gagcaagacc 
aaagatttgc 
aatttcttcc 
gttcggacgt 
gctggaggaa 
ctgggcttgc 
cgcgcgcttg 



ggggttgcag 
ctgtctcaac 
tttctcggct 
agacaccaat 
cattgggagg 
acgctcgttg 
gcagtgtcga 
cgcactttag 



aaagccgaga 
aaaaaatttt 
tcagtgtggg 
cccggaggtc 
cgaggctaga 
ctaaggaacg 
cgccggcgcc 
ccagcgcagg 



tcgtgccact 
taaaaaataa 
cggtaactcc 
gcttctgttg 
gcggggttgt 
cagcgctctt 
ggcgcgccgg 
gcgcaccccg 



gcactccatc 
aataaataat 
atcgtgcaat 
ctaggctccc 
gtgtggcgga 
cccgctctgg 
ggtttgaaag 
ctccctccca 



60 

120 

180 

240 

300 

360 

420 

480 



18 



ctctccctgc ccctcggacc ccatactcta cctcatcctt ctggccaggc gaagcccacg 540 
acgttgacat gccggagatc cgcctccgcc atgtcgtgtc ctgcagcagc caggactcga 600 
ctcactgtgc agaaaatctt ctcaaggcag acacttaccg aaaatggcgg gcagccaagg 660 
caggcgagaa gaccatctct gtggtcctac agttggagaa ggaggagcag atacacagtg 720 
tggacattgg gaatgatggc tcagctttcg tggaggtgct ggtgggcagt tcagctggag 780 
gcgctgggga gcaagactat gaggtccttc tggtcacctc atctttcatg tccccttccg 840 
agagccgcag tggctcaaac cccaaccgcg ttcgcatgtt tgggcctgac aagctggtcc 900 
gggcagccgc cgagaagcgc tgggaccggg tcaaaattgt ttgcagccag ccctacagca 960 
aggactcccc ctttggcttg agttttgtac ggtttcatag ccccccagac aaagatgagg 1020 
cagaggcccc gtcccagaag gtgacagtga ccaagcttgg ccagttccgt gtgaaggagg 1080 
aggatgagag cgccaactct ctgaggccgg gggctctctt cttcagccgg atcaacaaga 1140 
catccccagt cacagccagc gacccggcag gacctagcta tgcagctgct accctccagg 1200 
cttctagtgc tgcctcctca gcctctccag tctccagggc cataggcagc acctccaagc 1260 
cccaggagtc tccctggcat tcgttcgtcc ccgatggatc tacagttgca atgagaagca 1320 
gaagttactt cctcaccagc tctatggggt ggtgccgcaa gcctgaagta tgtgctatac 1380 
acacacacac acacacacac acacacacac acacacgatg catttaataa agatgagttg 1440 
gttctcatcc aagagtctcc caaaactcta agaggctccc tgggacctgg ggaagaatgc 1500 
tgggcacctc cgtcagagat ctggtacaca aggaactctt tgtctcttct gcttggcccc 1560 
ttatccctgt gttggcaaga ggcagggaac tgggaatctg accctcagca ctgcccctca 1620 
actttttctg gccctctgag ccacacctgt atcttggctg tccctttgtg gctggannnt 1680 
gggtacccat gaggcttgtc tctctcctga agcctca 1717 



<210> 25 

<211> 1346 

<212> DNA 

<213> Homo sapiens 

<400> 25 

tcccttggcc ccaggagaca ggggttgcag aaagccgaga tcgtgccact gcactccatc 60 

ctgggtgaga gagcaagacc ctgtctcaac aaaaaatttt taaaaaataa aataaataat 120 

aatacagcaa aaagatttgc tttctcggct tcagtgtggg cggtaactcc atcgtgcaat 180 

gagaaaggcg aatttcttcc agacaccaat cccggaggtc gcttctgttg ctaggctccc 240 

agaaagcagg gttcggacgt cattgggagg cgaggctaga gcggggttgt gtgtggcgga 300 

gggaggcggg gctggaggaa acgctcgttg ctaaggaacg cagcgctctt cccgctctgg 360 

agaggcgcga ctgggcttgc gcagtgtcga cgccggcgcc ggcgcgccgg ggtttgaaag 420 

gcccgagcct cgcgcgcttg cgcactttag ccagcgcagg gcgcaccccg ctccctccca 48 0 

ctctccctgc ccctcggacc ccatactcta cctcatcctt ctggccaggc gaagcccacg 540 

acgttgacat gccggagatc cgcctccgcc atgtcgtgtc ctgcagcagc caggactcga 600 

ctcactgtgc agaaaatctt ctcaaggcag acacttaccg aaaatggcgg gcagccaagg 660 

caggcgagaa gaccatctct gtggtcctac agttggagaa ggaggagcag atacacagtg 720 

tggacattgg gaatgatggc tcagctttcg tggaggtgct ggtgggcagt tcagctggag 780 

gcgctgggga gcaagactat gaggtccttc tggtcacctc atctttcatg tccccttccg 840 

agagccgcag tggctcaaac cccaaccgcg ttcgcatgtt tgggcctgac aagctggtcc 900 

gggcagccgc cgagaagcgc tgggaccggg tcaaaattgt ttgcagccag ccctacagca 960 

aggactcccc ctttggcttg agttttgtac ggtttcatag ccccccagac aaagatgagg 1020 

cagaggcccc gtcccagaag gtgacagtga ccaagcttgg ccagttccgt gtgaaggagg 1080 

aggatgagag cgccaactct ctgaggccgg gggctctctt cttcagccgg atcaacaaga 1140 

catccccagt cacagccagc gacccggcag gacctagcta tgcagctgct accctccagg 1200 

cttctagtgc tgcctcctca gcctctccag tctccagggc cataggcagc acctccaagc 1260 

cccaggagtc atcagacttt ggtggggtgg aggaggagag aagctggagg cctcaatcca 1320 

tccccatccc ctcagcccca gggagt 1346 



<210> 26 

<211> 1643 

<212> DNA 

<213> Homo sapiens 

<400> 26 

tcccttggcc ccaggagaca ggggttgcag aaagccgaga tcgtgccact gcactccatc 60 
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ctgggtgaga 
aatacagcaa 
gagaaaggcg 
agaaagcagg 
gggaggcggg 
agaggcgcga 
gcccgagcct 
ctctccctgc 
acgttgacat 
ctcactgtgc 
caggcgagaa 
tggacattgg 
gcgctgggga 
agagccgcag 
gggcagccgc 
aggactcccc 
cagaggcccc 
aggatgagag 
tgatcacagc 
tggcattcgt 
accagctcta 
acacacacac 
gtctcccaaa 
agagatctgg 
gcaagaggca 
tctgagccac 
cttgtctctc 



gagcaagacc 
aaagatttgc 
aatttcttcc 
gttcggacgt 
gctggaggaa 
ctgggcttgc 
cgcgcgcttg 
ccctcggacc 
gccggagatc 
agaaaatctt 
gaccatctct 
gaatgatggc 
gcaagactat 
tggctcaaac 
cgagaagcgc 
ctttggcttg 
gtcccagaag 
cgccaactct 
acaggaatgg 
tcgtccccga 
tggggtggtg 
acacacacac 
actctaagag 
tacacaagga 
gggaactggg 
acctgtatct 
tcctgaagcc 



ctgtctcaac 
tttctcggct 
agacaccaat 
cattgggagg 
acgctcgttg 
gcagtgtcga 
cgcactttag 
ccatactcta 
cgcctccgcc 
ctcaaggcag 
gtggtcctac 
tcagctttcg 
gaggtccttc 
cccaaccgcg 
tgggaccggg 
agttttgtac 
gtgacagtga 
ctgaggctcg 
igatcccagct 
tggatctaca 
ccgcaagcct 
acgatgcatt 
gctccctggg 
actctttgtc 
aatctgaccc 
tggctgtccc 
tea 



aaaaaatttt 
tcagtgtggg 
cccggaggtc 
cgaggctaga 
ctaaggaacg 
cgccggcgcc 
ccagcgcagg 
cctcatcctt 
atgtcgtgtc 
acacttaccg 
agttggagaa 
tggaggtgct 
tggtcacctc 
ttcgcatgtt 
tcaaaattgt 
ggtttcatag 
ccaagcttgg 
aggactatat 
ttgaggaggc 
gttgcaatga 
gaagtatgtg 
taataaagat 
acctggggaa 
tcttctgctt 
tcagcactgc 
tttgtggctg 



taaaaaataa 
cggtaactcc 
gcttctgttg 
gcggggttgt 
cagcgctctt 
ggcgcgccgg 
gcgcaccccg 
ctggccaggc 
ctgcagcagc 
aaaatggcgg 
ggaggagcag 
ggtgggcagt 
atctttcatg 
tgggcctgac 
ttgcagccag 
ccccccagac 
ccagttccgt 
gagtgaccgg 
cctgatggac 
gaagcagaag 
ctatacacac 
gagttggttc 
gaatgctggg 
ggccccttat 
ccctcaactt 
gannntgggt 



aataaataat 
atcgtgcaat 
ctaggctccc 
gtgtggcgga 
cccgctctgg 
ggtttgaaag 
ctccctccca 
gaagcccacg 
caggactcga 
gcagccaagg 
atacacagtg 
tcagctggag 
tccccttccg 
aagctggtcc 
ccctacagca 
aaagatgagg 
gtgaaggagg 
gttcagtttg 
aacccctccc 
ttacttcctc 
acacacacac 
tcatccaaga 
cacctccgtc 
ccctgtgttg 
tttctggccc 
acccatgagg 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1643 



<210> 27 

<211> 2713 

<212> DNA 

<213> Homo sapiens 



<400> 27 

tcccttggcc 

ctgggtgaga 

aatacagcaa 

gagaaaggcg 

agaaagcagg 

gggaggcggg 

agaggcgcga 

gcccgagcct 

ctctccctgc 

acgttgacat 

ctcactgtgc 

caggcgagaa 

tggacattgg 

gcgctgggga 

agagccgcag 

gggcagccgc 

aggactcccc 

cagaggcccc 

aggatgagag 

catccccagt 

cttctagtgc 

cccaggagtc 

ccagcaaacc 

caactcgtac 

aaccccgagg 

agatcct tea 



ccaggagaca 
gagcaagacc 
aaagatttgc 
aatttcttcc 
gttcggacgt 
gctggaggaa 
ctgggcttgc 
cgcgcgcttg 
ccctcggacc 
gccggagatc 
agaaaatctt 
gaccatctct 
gaatgatggc 
gcaagactat 
tggctcaaac 
cgagaagcgc 
ctttggcttg 
gtcccagaag 
cgccaactct 
cacagccagc 
tgcctcctca 
tcccaaaggg 
accagcccag 
cccagccaca 
agaaggcaec 
gggtgtggta 



ggggttgcag 
ctgtctcaac 
tttctcggct 
agacaccaat 
cattgggagg 
acgctcgttg 
gcagtgtcga 
cgcactttag 
ccatactcta 
cgcctccgcc 
ctcaaggcag 
gtggtcctac 
tcagctttcg 
gaggtccttc 
cccaaccgcg 
tgggaccggg 
agttttgtac 
gtgacagtga 
ctgaggccgg 
gacccggcag 
gcctctccag 
aagaggaagt 
ctgtcgccat 
gccceagtcc 
gagcccagac 
gtggtgctga 



aaagccgaga 
aaaaaatttt 
tcagtgtggg 
cccggaggtc 
cgaggctaga 
ctaaggaacg 
cgccggcgcc 
ccagcgcagg 
cctcatcctt 
atgtcgtgtc 
acacttaccg 
agttggagaa 
tggaggtgct 
tggtcacctc 
ttcgcatgtt 
tcaaaattgt 
ggtttcatag 
ccaagcttgg 
gggctctctt 
gacctagcta 
tctccagggc 
tggatttgaa 
ctgttcecaa 
ctgcccgagc 
gaccccgagc 
gtggcttcca 



tcgtgccact 
taaaaaataa 
cggtaactcc 
gcttctgttg 
gcggggttgt 
cagcgctctt 
ggcgcgccgg 
gcgcaccccg 
ctggccaggc 
ctgcagcagc 
aaaatggcgg 
ggaggagcag 
ggtgggcagt 
atctttcatg 
tgggcctgac 
ttgcagccag 
ccccccagac 
ccagttccgt 
cttcagccgg 
tgcagctgct 
cataggcagc 
ccaagaagaa 
gagacctaaa 
acagggggca 
tggcccagag 
gaaccccttc 



gcactccatc 
aataaataat 
atcgtgcaat 
ctaggctccc 
gtgtggcgga 
cccgctctgg 
ggtttgaaag 
ctccctccca 
gaagcccacg 
caggactcga 
gcagccaagg 
atacacagtg 
tcagctggag 
tccccttccg 
aagctggtcc 
ccctacagca 
aaagatgagg 
gtgaaggagg 
atcaacaaga 
accctccagg 
acctccaagc 
aagaagaccc 
ttgccagctc 
gtgacaggca 
gagctgggga 
cgctccgagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1-140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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tgcgagataa ggccctagag cttggggcca agtatcggcc agactggacc cgggacagca 1620 

cgcacctcat ctgtgcc.ttt gccaacaccc ccaagtacag ccaggtccta ggcctgggag 1680 

gccgcatcgt gcgtaaggag tgggtgctgg actgtcaccg catgcgtcgg cggctgccct 1740 

cccggaggta cctcatggca gggccaggtt ccagcagtga ggaggatgag gcctctcaca 1800 

gcggtggcag cggagatgaa gcccccaagc ttcctcagaa gcaaccccag accaaaacca 1860 

agcccactca ggcagctgga cccagctcac cccagaagcc cccaacccct gaagagacca 1920 

aagcagcctc accagtgctc caggaagata tagacattga gggggtacag tcagaaggac 1980 

aggacaatgg ggcggaagat tctggggaca cagaggatga gctgaggagg gtggcagagc 204 0 

agaaggaaca cagactgccc cctggccagg aggagaatgg ggaagacccg tatgcaggct 2100 

ccacggatga gaacacggac agtgaggaac accaggagcc tcctgatctg ccagtccctg 2160 

agctccccag atttcttcca gggcaagcac ttctttcttt acggggagtt ccctggggac 2220 

gagcggcgga aactcatccg atacgtcaca gccttcaatg gggagctccc tggcattcgt 2280 

tcgtccccga tggatctaca gttgcaatga gaagcagaag ttacttcctc accagctcta 2340 

tggggtggtg ccgcaagcct gaagtatgtg ctatacacac acacacacac acacacacac 2400 

acacacacac acgatgcatt taataaagat gagttggttc tcatccaaga gtctcccaaa 2460 

actctaagag gctccctggg acctggggaa gaatgctggg cacctccgtc agagatctgg 2520 

tacacaagga actctttgtc tcttctgctt ggccccttat ccctgtgttg gcaagaggca 2580 

gggaactggg aatctgaccc tcagcactgc ccctcaactt tttctggccc tctgagccac 2640 

acctgtatct tggctgtccc tttgtggctg gannntgggt acccatgagg cttgtctctc 2700 

tcctgaagcc tea 2713 



<210> 28 

<211> 2143 

<212> DNA 

<213> Homo sapiens 



<400> 28 

cggagtgccg 

cagggaccgt 

cgctggggac 

cctgaggcct 

gcgagggtcc 

tcagctctct 

ccgagatgga 

gccggactct 

cagtggcctg 

cagtcacctt 

aggagatcac 

tctactgccc 

gtgactacga 

aaagggtaat 

ggtacgcaga 

ctcaggacct 

agctgctgct 

cccccaagat 

gagtttacta 

cttcgtgtta 

agaaggaaag 

gctagaaagc 

gagacagctg 

gcccagaagg 

acggaggagg 

aagatggctg 

caggaagcac 

cccacgtacc 

aacctcattg 

atggagatag 

ctcaatgaac 

gatattctgc 

aagctcacct 



ggtcgcgcct 
cccccaactc 
ccgcgcagcc 
gtgcagcaac 
cgggcctgag 
caagaggaag 
gttcaattgc 
ggggctccga 
gctcaaaatg 
tcacttcttg 
acaacattta 
tcctgaggct 
ccccagtgtt 
aaatctgtat 
gcaccgaggc 
ggagatgtac 
tggagtggat 
ctccttcccg 
ttaaaccact 
ataagctgat 
ccgattcttt 
agatgaaaga 
acctgttggc 
ccgcagaggc 
agaagcgcct 
aggagtcaga 
gcgaggcgga 
cgcccatgaa 
gtgacagcct 
agaaagaaaa 
tcaagacaga 
acaatgagaa 
tgcagagcgc 



gcaccgaagg 
ccctttccgc 
cagaccgttc 
tccagggggg 
ccccgcgcca 
caacccaaga 
gagatgaagt 
gaaacctggt 
gacaagaagg 
gccaaatttt 
ttcttcttac 
tctgtgctcc 
cacaagcggg 
cagatgactc 
cgagccaggg 
ggtgtgaact 
gccctggggc 
tggaaaaaat 
ggataagaaa 
tctccagcta 
ggaagttcag 
agaagcaaca 
tgaaaaggcc 
tgagcaggaa 
gatggagcag 
gaggagggcc 
gcgaagagcc 
cccaattcca 
gtctttcgac 
agtggaatac 
aatcgaggcc 
ctccgacagg 
caagtcccga 



tcccggctcc 
tcaggcaggg 
ccgggccggc 
ctaaagggct 
tggccggggc 
cgttcaccgt 
ggaaagggaa 
tctttggact 
tactggatca 
atcctgagaa 
aggtaaagaa 
tggcttctta 
gatttttggc 
cggaaatgtg 
atgaagctga 
actttgcaat 
ttcacattta 
gaaatccgaa 
attgatgtct 
tgtatcggga 
cagatgaaag 
atggccaacg 
cagatcaccg 
atgcagcgca 
aaggtgctgg 
aaagaggcag 
aagcagaagc 
gcaccgttgc 
ttcaaagata 
atggaaaaga 
ttgaaactga 
ggtggcagca 
gtggccttct 



tgtgccctcc 
tcctcgcggc 
cagccggcca 
cagagtgcag 
catcgcttcc 
gaggatcgtc 
ggacctcttt 
gcagtacaca 
tgatgtttca 
tgctgaagag 
gcagatttta 
cgccgtccag 
ccaagaggaa 
ggaggagaga 
aatggaatat 
ccggaataaa 
tgaccctgag 
acatctcgta 
tcaagtttaa 
accatgatct 
cccaggccag 
aagcactgat 
aggaggaggc 
tcaaggccac 
aagccgaggt 
atcagctgaa 
tcctggagat 
ctcctgacat 
ctgacatgaa 
gcaagcatct 
aagagaggga 
gcaagcacaa 
ttgaagagct 



ctgcagccgt 
ccatgctggc 
ccatggtggc 
gccgtggggc 
cgcatgagct 
accatggacg 
gatttggtgt 
atcaaggaca 
aaggaagaac 
gagctggttc 
gatgaaaaga 
gccaagtatg 
ttgcttccaa 
attactgctt 
ctgaagatag 
aagggcacag 
aacagactga 
cagtgacaag 
ctcctcaaag 
atttatgagg 
ggaggagaag 
gcggtctgag 
aaaacttctg 
agcgattcgc 
gctggcactg 
gcaggacctg 
tgccaccaag 
accaagcttc 
gcggctttcc 
gcaggagcag 
gacagctctg 
taccattaaa 
ctagcaggtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 
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acccagccac cccaggacct gccacttctc ctgctaccgg gaccgcggga tggaccagat 2040 
atcaagagag ccatccatag ggagctggcn gggggtttcc gtgggagctc cagaactttc 2100 
cccagctgac atggaaatac aattgtctgc cggcctcccc tea 2143 



<210> 29 

<211> 540 

<212> DNA 

<213> Homo sapiens 

<400> 29 

gcgtagacgc gccgcgtccc cgcytgccgc tgctccgccg cagtcgccgc tccagtctat 60 
ccggcactag gaacagcccc gagcggcgag acggtccccg ccatgtctgc ggccatgagg 120 
gagaggttcg accggttcct gcacgagaag aactgcatga ctgaccttct ggccaagctc 180 
gaggccaaaa ccggcgtgaa caggagcttc atcgctcttg gtgtcatcgg actggtggcc 24 0 
ttgtacctgg tgttcggtta tggagcctct ctcctctgca acctgatagg atttggctac 300 
ccagcctaca tctcaattaa agctatagag agtcccaaca aagaagatga tacccagtgg 360 
ctgacctact gggtagtgta tggtgtgttc agcattgctg aattcttctc tgatatcttc 420 
ctgtcatggt tccccttcta ctacatgctg aagcagattt atttagaacc tccatgtgcc 480 
agattctgtt ctacatctgg gagatacttt ggcgaataaa acaaagtcct tgttgtcaaa 540 



<210> 30 

<211> 4533 

<212> DNA 

<213> Homo sapiens 

<400> 30 

cctactctat tcagatattc tccagattcc taaagattag agatcatttc tcattctcct 60 
aggagtactc acttcaggaa gcaaccagat aaaagagagg tgcaacggaa gccagaacat 120 
tcctcctgga aattcaacct gtttcgcagt ttctcgagga atcagcattc agtcaatccg 18 0 
ggccgggagc agtcatctgt ggtgaggctg attggctggg caggaacagc gccggggcgt 24 0 
gggctgagca cagcgcttcg ctctctttgc cacaggaagc ctgagctcat tcgagtagcg 300 
gctcttccaa gctcaaagaa gcagaggccg ctgttcgttt cctttaggtc tttccactaa 360 
agtcggagta tcttcttcca agatttcacg tcttggtggc cgttccaagg agcgcgaggt 420 
cgggatggat cttgaagggg accgcaatgg aggagcaaag aagaagaact tttttaaact 48 0 
gaacaataaa agtgaaaaag ataagaagga aaagaaacca actgtcagtg tattttcaat 540 
gtttcgctat tcaaattggc ttgacaagtt gtatatggtg gtgggaactt tggctgccat 600 
catccatggg gctggacttc ctctcatgat gctggtgttt ggagaaatga cagatatctt 660 
tgcaaatgca ggaaatttag aagatctgat gtcaaacatc actaatagaa gtgatatcaa 720 
tgatacaggg ttcttcatga atctggagga agacatgacc aggtatgcct attattacag 780 
tggaattggt gctggggtgc tggttgctgc ttacattcag gtttcatttt ggtgcctggc 840 
agctggaaga caaatacaca aaattagaaa acagtttttt catgctataa tgcgacagga 900 
gataggctgg tttgatgtgc acgatgttgg ggagcttaac acccgactta cagatgatgt 960 
ctctaagatt aatgaagtta ttggtgacaa aattggaatg ttctttcagt caatggcaac 1020 
atttttcact gggtttatag taggatttac acgtggttgg aagctaaccc ttgtgatttt 1080 
ggccatcagt cctgttcttg gactgtcagc tgctgtctgg gcaaagatac tatcttcatt 1140 
tactgataaa gaactcttag cgtatgcaaa agctggagca gtagctgaag aggtcttggc 1200 
agcaattaga actgtgattg catttggagg acaaaagaaa gaacttgaaa ggtacaacaa 12 60 
aaatttagaa gaagctaaaa gaattgggat aaagaaagct attacagcca atatttctat 1320 
aggtgctgct ttcctgctga tctatgcatc ttatgctctg gccttctggt atgggaccac 1380 
cttggtcctc tcaggggaat attctattgg acaagtactc actgtattct tttctgtatt 1440 
aattggggct tttagtgttg gacaggcatc tccaagcatt gaagcatttg caaatgcaag 1500 
aggagcagct tatgaaatct tcaagataat tgataataag ccaagtattg acagctattc 1560 
gaagagtggg cacaaaccag ataatattaa gggaaatttg gaattcagaa atgttcactt 1620 
cagttaccca tctcgaaaag aagttaagat cttgaagggc ctgaacctga aggtgcagag 1680 
tgggcagacg gtggccctgg ttggaaacag tggctgtggg aagagcacaa cagtccagct 1740 
gatgcagagg ctctatgacc ccacagaggg gatggtcagt gttgatggac aggatattag 1800 
gaccataaat gtaaggtttc tacgggaaat cattggtgtg gtgagtcagg aacctgtatt 1860 
gtttgccacc acgatagctg aaaacattcg ctatggccgt gaaaatgtca ccatggatga 1920 
gattgagaaa gctgtcaagg aagccaatgc ctatgacttt atcatgaaac tgcctcataa 1980 
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atttgacacc ctggttggag agagaggggc ccagttgagt ggtgggcaga agcagaggat 2040 
cgccattgca cgtgccctgg ttcgcaaccc caagatcctc ctgctggatg aggccacgtc 2100 
agccttggac acagaaagcg aagcagtggt tcaggtggct ctggataagg ccagaaaagg 2160 
tcggaccacc attgtgatag ctcatcgttt gtctacagtt cgtaatgctg acgtcatcgc 2220 
tggtttcgat gatggagtca ttgtggagaa aggaaatcat gatgaactca tgaaagagaa 2280 
aggcatttac ttcaaacttg tcacaatgca gacagcagga aatgaagttg aattagaaaa 2340 
tgcagctgat gaatccaaaa gtgaaattga tgccttggaa atgtcttcaa atgattcaag 2400 
atccagtcta ataagaaaaa gatcaactcg taggagtgtc cgtggatcac aagcccaaga 2460 
cagaaagctt agtaccaaag aggctctgga tgaaagtata cctccagttt ccttttggag 2520 
gattatgaag ctaaatttaa ctgaatggcc ttattttgtt gttggtgtat tttgtgccat 2580 
tataaatgga ggcctgcaac cagcatttgc aataatattt tcaaagatta taggggtttt 2640 
tacaagaatt gatgatcctg aaacaaaacg acagaatagt aacttgtttt cactattgtt 2700 
tctagccctt ggaattattt cttttattac atttttcctt cagggtttca catttggcaa 2760 
agctggagag atcctcacca agcggctccg atacatggtt ttccgatcca tgctcagaca 2820 
ggatgtgagt tggtttgatg accctaaaaa caccactgga gcattgacta ccaggctcgc 2880 
caatgatgct gctcaagtta aaggggctat aggttccagg cttgctgtaa ttacccagaa 2940 
tatagcaaat cttgggacag gaataattat atccttcatc tatggttggc aactaacact 3000 
gttactctta gcaattgtac ccatcattgc aatagcagga gttgttgaaa tgaaaatgtt 3060 
gtctggacaa gcactgaaag ataagaaaga actagaaggt gctgggaaga tcgctactga 3120 
agcaatagaa aacttccgaa ccgttgtttc tttgactcag gagcagaagt ttgaacatat 3180 
gtatgctcag agtttgcagg taccatacag aaactctttg aggaaagcac acatctttgg 3240 
aattacattt tccttcaccc aggcaatgat gtatttttcc tatgctggat gtttccggtt 3300 
tggagcctac ttggtggcac ataaactcat gagctttgag gatgttctgt tagtattttc 3360 
agctgttgtc tttggtgcca tggccgtggg gcaagtcagt tcatttgctc ctgactatgc 3420 
caaagccaaa atatcagcag cccacatcat catgatcatt gaaaaaaccc ctttgattga 3480 
cagctacagc acggaaggcc taatgccgaa cacattggaa ggaaatgtca catttggtga 3540 
agttgtattc aactatccca cccgaccgga catcccagtg cttcagggac tgagcctgga 3600 
ggtgaagaag ggccagacgc tggctctggt gggcagcagt ggctgtggga agagcacagt 3660 
ggtccagctc ctggagcggt tctacgaccc cttggcaggg aaagtgctgc ttgatggcaa 3720 
agaaataaag cgactgaatg ttcagtggct ccgagcacac ctgggcatcg tgtcccagga 3780 
gcccatcctg tttgactgca gcattgctga gaacattgcc tatggagaca acagccgggt 3840 
ggtgtcacag gaagagatcg tgagggcagc aaaggaggcc aacatacatg ccttcatcga 3900 
gtcactgcct aataaatata gcactaaagt aggagacaaa ggaactcagc tctctggtgg 3960 
ccagaaacaa cgcattgcca tagctcgtgc ccttgttaga cagcctcata ttttgctttt 4020 
ggatgaagcc acgtcagctc tggatacaga aagtgaaaag gttgtccaag aagccctgga 4080 
caaagccaga gaaggccgca cctgcattgt gattgctcac cgcctgtcca ccatccagaa 4140 
tgcagactta atagtggtgt ttcagaatgg cagagtcaag gagcatggca cgcatcagca 4200 
gctgctggca cagaaaggca tctatttttc aatggtcagt gtccaggctg gaacaattta 42 60 
gaagacccaa tttggggttt attacggcat ttgaaatttg agagttagaa gtgacacctg 4320 
tcttgatgaa gtattgaaac aacaggctag ctctccctaa gttattagaa acatccagaa 4380 
aaagaacaaa attcaaggta agtacaggtg tagaaatata gaagccacag acatagggaa 4440 
atggtcttat ggtcctgaat gggttgggta gacaggaaag tttgggaata attttcccaa 4500 
gtacaaaaga aaaataaatt gttcaatagc etc 4533 



<210> 31 

<211> 2307 

<212> DNA 

<213> Homo sapiens 

<400> 31 

cctactctat tcagatattc tccagattcc taaagattag agatcatttc tcattctcct 60 
aggagtactc . acttcaggaa gcaaccagat aaaagagagg tgcaacggaa gccagaacat 120 
tcctcctgga aattcaacct gtttcgcagt ttctcgagga atcagcattc agtcaatccg 180 
ggccgggagc agtcatctgt ggtgaggctg attggctggg caggaacagc gccggggcgt 240 
gggctgagca cagcgcttcg ctctctttgc cacaggaagc ctgagctcat tcgagtagcg 300 
gctcttccaa gctcaaagaa gcagaggccg ctgttcgttt cctttaggtc tttccactaa 360 
agtcggagta tcttcttcca agatttcacg tcttggtggc cgttccaagg agcgcgaggt 420 
cgggatggat cttgaagggg accgcaatgg aggagcaaag aagaagaact tttttaaact 480 
gaacaataaa agtgaaaaag ataagaagga aaagaaacca actgtcagtg tattttcaat 540 
gtttcgctat tcaaattggc ttgacaagtt gtatatggtg gtgggaactt tggctgccat 600 
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catccatggg 
tgcaaatgca 
tgatacaggg 
tggaattggt 
agctggaaga 
gataggctgg 
ctccaagatt 
atttttcact 
ggccatcagt 
tactgataaa 
agcaattaga 
aaatttagaa 
aggtgctgct 
cttggtcctc 
aattggggct 
aggagcagct 
gaagagtggg 
cagttaccca 
tgggcagacg 
gatgcagagg 
gaccataaat 
gtttgccacc 
gattgagaaa 
atttgacacc 
cgccattgca 
agcattggac 
ctaaaaagct 
agtctgggct 
tctcctttct 



gctggacttc 
ggaaatttag 
ttcttcatga 
gctggggtgc 
caaatacaca 
tttgatgtgc 
aatgaaggaa 
gggtttatag 
cctgttcttg 
gaactcttag 
actgtgattg 
gaagctaaaa 
ttcctgctga 
tcaggggaat 
tttagtgttg 
tatgaaatct 
cacaaaccag 
tctcgaaaag 
gtggccctgg 
ctctatgacc 
gtaaggtttc 
acgatagctg 
gctgtcaagg 
ctggttggag 
cgtgccctgg 
acagaaagtg 
gaaggaccac 
gagaaacaga 
gtcaggttac 



ctctcatgat 
aagatctgat 
atctggagga 
tggttgctgc 
aaattagaaa 
acgatgttgg 
ttggtgacaa 
taggatttac 
gactgtcagc 
cgtatgcaaa 
catttggagg 
gaattgggat 
tctatgcatc 
attctattgg 
gacaggcatc 
tcaagataat 
ataatattaa 
aagttaagat 
ttggaaacag 
ccacagaggg 
tacgggaaat 
aaaacattcg 
aagccaatgc 
agagaggggc 
ttcgcaaccc 
aagctgaggt 
cacattgaaa 
aacatagcaa 
attgtcc 



gctggtgttt 
gtcaaacatc 
agacatgacc 
ttacattcag 
acagtttttt 
ggagcttaac 
aattggaatg 
acgtggttgg 
tgctgtctgg 
agctggagca 
acaaaagaaa 
aaagaaagct 
ttatgctctg 
acaagtactc 
tccaagcatt 
tgataataag 
gggaaatttg 
cttgaagggc 
tggctgtggg 
gatggtcagt 
cattggtgtg 
ctatggccgt 
ctatgacttt 
ccagctgagt 
caagatcctt 
acaggcagct 
cctattgaag 
atggagctac 



ggagaaatga 
actaatagaa 
aggtatgcct 
gtttcatttt 
catgctataa 
acccgactta 
ttctttcagt 
aagctaaccc 
gcaaagatac 
gtagctgaag 
gaacttgaaa 
attacagcca 
gccttctggt 
actgtattyt 
gaagcatttg 
ccaagtattg 
gaattcagaa 
ctgaacctga 
aagagcacaa 
gttgatggac 
gtgagtcagg 
gaaaatgtca 
atcatgaaac 
ggtgggcaga 
ctgctggatg 
ctggataagg 
attcttgcca 
ctcatggagc 



cagatatctt 
gtgatatcaa 
attattacag 
ggtgcctggc 
tgcgacagga 
cagatgatgt 
caatggcaac 
ttgtgatttt 
tatcttcatt 
aggtcttggc 
ggtacaacaa 
atatttctat 
atgggaccac 
tttctgtatt 
caaatgcaag 
acagctattc 
atgttcactt 
aggtgcagag 
cagtccagct 
aggatattag 
aacctgtatt 
ccatggatga 
tgcctcataa 
agcagaggat 
aggccacgtc 
tcagtagact 
gtgcttccgg 
tgtattgatt 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2307 



<210> 32 

<211> 1712 

<212> DNA 

<213> Homo sapiens 

<400> 32 

gcttggattg cggagccgcg agcagcgctg ggtaacggcc gcggcgacca ccccggacgg 60 
Gccctgtccc cgctggcggg cttccctgtc gccgttcgct gcgctgccgg cttcttggtg 120 
aatttttgga tgaagccatt aaattaattg cttgccatca tgagcagaag caagcgtgac 180 
aacaattttt atagtgtaga gattggagat tctacattca cagtcctgaa acgatatcag 240 
aatttaaaac ctataggctc aggagctcaa ggaatagtat gcgcagctta tgatgccatt 300 
cttgaaagaa atgttgcaat caagaagcta agccgaccat ttcagaatca gactcatgcc 360 
aagcgggcct acagagagct agttcttatg aaatgtgtta atcacaaaaa tataattggc 420 
cttttgaatg ttttcacacc acagaaatcc ctagaagaat ttcaagatgt ttacatagtc 480 
atggagctca tggatgcaaa tctttgccaa gtgattcaga tggagctaga tcatgaaaga 54 0 
atgtcctacc ttctctatca gatgctgtgt ggaatcaagc accttcattc tgctggaatt 600 
attcatcggg acttaaagcc cagtaatata gtagtaaaat ctgattgcac tttgaagatt 660 
cttgacttcg gtctggccag gactgcagga acgagtttta tgatgacgcc ttatgtagtg 720 
actcgctact acagagcacc cgaggtcatc cttggcatgg gctacaagga aaacggagga 780 
agaatgggaa aaggcatatt cacaaggtta caataaggtg cctgtgagat ataaaattta 840 
taactgccac atcctttctt aggaattttt aaatttctat tttcttgtaa tatgaataca 900 
agaatacatt cttgtaaatg aatgtattga acattagtta tggagtattt ttcttagcta 960 
cttgatatta gatattgatc agtggaataa agttattgaa cagcttggaa caccatgtcc 1020 
tgaattcatg aagaaactgc aaccaacagt aaggacttac gttgaaaaca gacctaaata 1080 
tgctggatat agctttgaga aactcttccc tgatgtcctt ttcccagctg actcagaaca 1140 
caacaaactt aaagccagtc aggcaaggga tttgttatcc aaaatgctgg taatagatgc 1200 
atctaaaagg atctctgtag atgaagctct ccaacacccg tacatcaatg tctggtatga 1260 
tccttctgaa gcagaagctc caccaccaaa gatccctgac aagcagttag atgaaaggga 1320 
acacacaata gaagagtgga aagaattgat atataaggaa gttatggact tggaggagag 1380 
aaccaagaat ggagttatac gggggcagcc ctctccttta gcacaggtgc agcagtgatc 1440 
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1 



aatggctctc agcatccatc atcatcgtcg 
gatccgactt tggcctctga tacagacagc 
tgctgtagat gactacttgg gccatcgggg 
tgatagaact actttgaaaa caattcagtg 
gtaggatttc attttgtagt aaagtagttt 



tctgtcaatg atgtgtcttc aatgtcaaca 1500 

agtctagaag cagcagctgg gcctctgggc 1560 

ggtgggaggg atggggagtc ggttagtcat 1620 

gtcttatttt tgggtgattt ttcaaaaaat 1680 

at 1712 



<210> 33 

<211> 1068 

<212> DNA 

<213> Homo sapiens 

<400> 33 

cattaattgc ttgccatcat gagcagaagc aagcgtgaca acaattttta tagtgtagag 60 

attggagatt ctacattcac agtcctgaaa cgatatcaga atttaaaacc tataggctca 120 

ggagctcaag gaatagtatg cgcagcttat gatgccattc ttgaaagaaa tgttgcaatc 180 

aagaagctaa gccgaccatt tcagaatcag actcatgcca agcgggccta cagagagcta 240 

gttcttatga aatgtgttaa tcacaaaaat ataattggcc ttttgaatgt tttcacacca 300 

cagaaatccc tagaagaatt tcaagatgtt tacatagtca tggagctcat ggatgcaaat 360 

ctttgccaag tgattcagat ggagctagat catgaaagaa tgtcctacct tctctatcag 420 

atgctgtgtg gaatcaagca ccttcattct gctggaatta ttcatcggga cttaaagccc 4 80 

agtaatatag tagtaaaatc tgattgcact ttgaagattc ttgacttcgg tctggccagg 540 

actgcaggaa cgagttttat gatgacgcct tatgtagtga ctcgctacta cagagcaccc 600 

gaggtcatcc ttggcatggg ctacaaggaa aacgtggatt tatggtctgt ggggtgcatt 660 

atgggagaaa tggtttgcca caaaatcctc tttccaggaa gggactatat tgatcagtgg 720 

aataaagtta ttgaacagct tggaacacca tgtcctgaat tcatgaagaa actgcaacca 780 

acagtaagga cttacgttga aaacagacct aaatatgctg gatatagctt tgagaaactc 840 

ttccctgatg tccttttccc agctgactca gaacacaaca aacttaaagc cagtcagtac 900 

tttttacaaa tatgtacatt taatcccatt tggggtgtgt agtgtgtgtg tnatgggttt 960 

ggtgtttata tgtattcata ttcttatggg acatgaaccc aaggttttct ctggatggtg 1020 

gggaaaaaaa tgaggttttt gttttttttt tctttaatct tatatatt 1068 



<210> 34 

<211> 1388 

<212> DNA 

<213> Homo sapiens 



<400> 34 

cattaattgc ttgccatcat gagcagaagc 
attggagatt ctacattcac agtcctgaaa 
ggagctcaag gaatagtatg cgcagcttat 
aagaagctaa gccgaccatt tcagaatcag 
gttcttatga aatgtgttaa tcacaaaaat 
cagaaatccc tagaagaatt tcaagatgtt 
ctttgccaag tgattcagat ggagctagat 
atgctgtgtg gaatcaagca ccttcattct 
agtaatatag tagtaaaatc tgattgcact 
actgcaggaa cgagttttat gatgacgcct 
gaggtcatcc ttggcatggg ctacaaggaa 
atgggagaaa tggtttgcca caaaatcctc 
aataaagtta ttgaacagct tggaacacca 
acagtaagga cttacgttga aaacagacct 
ttccctgatg tccttttccc agctgactca 
agggatttgt tatccaaaat gctggtaata 
gctctccaac acccgtacat caatgtctgg 
tgtaagttat tttcttaatg tttacagaac 
taggtttgga ggagaccttt taattttaaa 
ttctgtttct tcatgaagac tacgtcaaat 
cacaaaataa agttacctcc cactgttttt 
actaggatgt tgaatgctct gggggaacat 



aagcgtgaca acaattttta tagtgtagag 60 
cgatatcaga atttaaaacc tataggctca 120 
gatgccattc ttgaaagaaa tgttgcaatc 180 
actcatgcca agcgggccta cagagagcta 240 
ataattggcc ttttgaatgt tttcacacca 300 
tacatagtca tggagctcat ggatgcaaat 360 
catgaaagaa tgtcctacct tctctatcag 420 
gctggaatta ttcatcggga cttaaagccc 480 
ttgaagattc ttgacttcgg tctggccagg 540 
tatgtagtga ctcgctacta cagagcaccc 600 
aacgtggatt tatggtctgt ggggtgcatt 660 
tttccaggaa gggactatat tgatcagtgg 720 
tgtcctgaat tcatgaagaa actgcaacca 780 
aaatatgctg gatatagctt tgagaaactc 840 
gaacacaaca aacttaaagc cagtcaggca 900 
gatgcatcta aaaggatctc tgtagatgaa 960 
tatgatcctt ctgaagcaga agctagaagc 1020 
atattgcatt cttagagtta gaatgacagt 1080 
taaaaatgta gatacatgat gatgatgttt 1140 
aaactaatga acatattcga gcccctccta 1200 
tgcaatcttg cctggatacc taaccagaga 1260 
cctaactcag gtataaaaca aattactgta 1320 
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tccaaaggaa aacagaattc tgtgatctgt gatataaata aaatgtggca 
ctagaaga 



atttcaagag 



1380 
1388 



<210> 35 
<211> 1452 
<212> DNA 

<213> Homo sapiens 
<400> 35 

gcccgcagtg gcttttaccg 
taccgggacc tgcagcccgt 
ggccgcaccg gcgctaaggt 
ttcgccaagc gcgcctaccg 
atcgggctgc tggacgtatt 
ctggtgatgc cgttcatggg 
gaggaccgga tccagttcct 
gccggcatca tccacagagt 
gcaatccccc cgcctccacg 
acgccctatg cacagcccct 
aacctggctg tgaacgaaga 
gcagacagtg agatgactgg 
ttgaattgga tgcgctacac 
gagatgatca caggcaagac 
atcatgaag.g tgacngggga 
ggccaagaac tacatgaagg 
gaccaatgca agccctctgg 
gcagcgggtg acggcaggcg 
ggaagatgag ccccaggtcc 
ggatgaatgg aagcgtgtta 
tgggggccag ggtctncaan 
cagtgaggac caccttcacc 
gctggggctt gcattccaag 
tcccacgggc atgcctntgt 
anctctggnc ca 



ccaggaggtg accaagacgg 
gggctcgggc gcctacggcg 
ggccatcaag aagctgtatc 
cgagctgcgc ctgctcaagc 
cactcctgat gagaccctgg 
caccgacctg ggcaagctca 
cgtgtaccag atgctgaagg 
gagtcccggt ggagaagccg 
tcccacctgt gaggatgtga 
ggtgggaacc tgcctgggtg 
ctgtgagctg aagatcctgg 
gtacgtggtg acccggtggt 
gcagacggtg gacatctggt 
gctgttcaag ggcagcgacc 
cgcctccggc tgagtttgtg 
gcctccccga attggagaag 
ctgtgaacct cctggagaag 
aggcgctggc ccatccctac 
agaagtatga tgactccttt 
cttacaaaga ggtgctcagc 
gagacgcctn tgtgaagatc 
ttccacctga gaggggactc 
gcatccattc agacagacgc 
cttgggcgcc catatggang 



cctgggaggt gcgcgccgtg 60 

cggtgtgctc ggccgtggac 120 

ggcccttcca gtccgagctg 180 

acatgcgcca cgagaacgtg 240 

atgacttcac ggacttttac 300 

tgaaacatga gaagctaggc 360 

ggctgaggta tatccacgct 420 

ctcatcagcc ctcccccagt 480 

tggggtctgg gtgctgagcc 54 0 

cccaggacct gaagcccggc 600 

acttcggcct ggccaggcag 660 

accgggctcc cgaggtcatc 720 

ctgtgggctg catcatggcg 780 

acctggacca gctgaaggag 84 0 

cagcggctgc agagcgatga 900 

aaggattttg cctctatcct 960 

atgctggtgc tggacgcgga 1020 

ttcgagtccc tgcacgacac 1080 

gacgacgttg accgcacact 1140 

ttcaagcctn cccggcaggc 1200 

tcttgggctt ccggggtggg 12 60 

tcgntggcac cttgaccttg 1320 

ccgggttcct ggaccctnct 138 0 

agcnctgact ttctggacaa 144 0 

1452 



<210> 36 

<211> 2355 

<212> DNA 

<213> Homo sapiens 

<400> 36 

gaattccgcc agccccgcca gtccccgcgc 

agcagcggcg ccgtgctcgc tccagggcgc 

gtccaacttt gactgcgggt cctgccagtc 

tgctgtgctc gtcaaagagt atgtcgaatc 

gcctaccatg tacccaccct gggacagcac 

catgcagatc attgtgaaag gcaaaaacgt 

ctactcgctg gctgagaggt gcaggaagaa 

gaaacctcaa ggccgaatgc taatgaatgc 

ggacatgaat gaatttgaga cggaaggctt 

caagcaggca aaggtccacc acgtcaagtg 

gcccacattt tgctctgtct gccacgagtt 

gtgccgacaa tgcaatgcag caattcacaa 

cacaggatca gctatcaata gccgagaaac 

catgccacac agatttaaag tctacaatta 

gaccctgctg tggggactgg cacggcaagg 

gcatcataga tgccagacaa aggtggccaa 

tgaagcgctg gccatgattg agagcactca 

gatcttcaga gaaggtccgg ttgaaattgg 



agtccccgcg cagtcccagc gccaccgggc 60 
aaccatgtcg ccatttcttc ggattggctt 120 
ttgtcagggc gaggctgtta acccttactg 180 
agagaacggg cagatgtata tccagaaaaa 240 
ttttgatgcc catatcaaca agggaagagt 300 
ggacctcatc tctgaaacca ccgtggagct 360 
caacgggaag acagaaatat ggttagagct 420 
aagatacttt ctggaaatga gtgacacaaa 480 
ctttgctttg catcagcgcc ggggtgccat 540 
ccacgagttc actgccacct tcttcccaca 600 
tgtctggggc ctgaacaaac agggctacca 660 
gaagtgtatt gataaagtta tagcaaagtg 720 
catgttccac aaggagagat tcaaaattga 780 
caagagcccg accttctgtg aacactgtgg 840 
actcaagtgt gatgcatgtg gcatgaatgt 900 
cctttgtggc ataaaccaga agctaatggc 960 
acaggctcgc tgcttaagag atactgaaca 1020 
tctcccatgc tccatcaaaa atgaagcaag 1080 
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gccgccatgt 
gttggatgag 
atctctgcag 
aagttttggc 
ggccttaaag 
gagagttctt 
gaccaaggaa 
catccaaagc 
tcttggtctg 
catcctgtta 
catgttagga 
gatcttgctg 
ttatgaaatg 
ctccatccgc 
tctggtgaag 
ggtcagtttt 
aactgggttt 
cattaaatgt 
ccccaggaac 
aaacaagcac 
gaggattgaa 
aaacaataaa 



ttaccgacac 
gtggataaaa 
attaaactaa 
aaggtcttcc 
aaagatgtgg 
tccttggcct 
aacctctttt 
tgccacaagt 
cagttccttc 
gacaaagatg 
gatgccaaga 
ggtcagaaat 
ctgattggtc 
atggacaatc 
gtaagaagcg 
ctgttcctct 
aaatgggatg 
caaaatgagt 
cagttccatg 
agtgtgtggc 
gttggaagtg 
gggga 



cgggaaaaag 
tgtgccatct 
aaattgagga 
tggcagaatt 
tcttgatgga 
gggagcatcc 
ttgtgatgga 
tcgacctttc 
attccaaagg 
gacatatcaa 
cgaatacctt 
acaaccactc 
agtcgccttt 
ccttttaccc 
aagccaagag 
tagtttccaa 
tgcaccgtct 
gtaagagaaa 
gaccttgaat 
ttaaaaatca 
tgaggtcagc 



agagcctcag 
tccagaacct 
ttttatcttg 
caagaaaacc 
cgatgatgtt 
gtttctgacg 
gtacctcaac 
cagagcgacg 
aatagtctac 
gatcgcggat 
ctgtgggaca 
tgtggactgg 
ccacgggcag 
acggtggctg 
cgtcttcata 
cttgcatgtg 
gtgttttaat 
ccctcatgtg 
acgctcacct 
atcttctagc 
ctgggcaatg 



ggcatttcct 
gaactgaaca 
cacaaaatgt 
aatcaatttt 
gagtgcacga 
cacatgtttt 
ggaggggact 
ttttatgctg 
agggacctga 
tttggaatgt 
cctgactaca 
tggtccttcg 
gatgaggagg 
gagaaggaag 
agacgagcat 
gcaaacaatg 
agaggcacca 
catcagttat 
ggagatgtag 
tacttgggag 
tatcgagacc 



gggagtctcc 
aagaaagacc 
tggggaaagg 
tcgcaataaa 
tggtagagaa 
gtacattcca 
taatgtacca 
ctgaaatcat 
agctagataa 
gcaaggagaa 
tcgccccaga 
gggttctcct 
agctcttcca 
caaaggacct 
taggtcttct 
attattattg 
atattatgag 
aacataacgg 
ttggttcatt 
gttgaggcag 
cctgtctcca 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2355 



<210> 37 
<211> 497 
<212> PRT 

<213> Homo sapiens 
<400> 37 

Met Val Arg Ser Gly Asn Lys Ala Ala Val Val Leu Cys Met Asp Val 
15 10 15 

Gly Phe Thr Met Ser Asn Ser lie Pro Gly lie Glu Ser Pro Phe Glu 
20 25 30 

Gin Ala Lys Lys Val lie Thr Met Phe Val Gin Arg Gin Val Phe Ala 
35 40 45 

Glu Asn Lys Asp Glu lie Ala Leu Val Leu Phe Gly Thr Asp Gly Thr 
50 55 60 

Asp Asn Pro Leu Ser Gly Gly Asp Gin Tyr Gin Asn lie Thr Val His 
65 70 75 80 

Arg His Leu Met Leu Pro Asp Phe Asp Leu Leu Glu Asp lie Glu Ser 
85 90 95 

Lys lie Gin Pro Gly Ser Gin Gin Ala Asp Phe Leu Asp Ala Leu lie 
100 105 110 

Val Ser Met Asp Val lie Gin His Glu Thr lie Gly Lys Lys Phe Glu 
115 120 125 

Lys Arg His lie Glu lie Phe Thr Asp Leu Ser Ser Arg Phe Ser Lys 
130 135 140 

Ser Gin Leu Asp lie lie lie His Ser Leu Lys Lys Cys Asp lie Ser 
145 150 155 160 

Leu Gin Phe Phe Leu Pro Phe Ser Leu Gly Lys Glu Asp Gly Ser Gly 
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165 



170 



175 



Asp Arg Gly Asp 
180 

Pro Leu Lys Gly 
195 

Lys Met Val Met 
210 

Tyr Ser Phe Ser 
225 

Glu Arg His Ser 



Leu Ser lie Arg 
260 

Lys Lys Thr Trp 
275 

lie Gin Lys Glu 
290 

Val Leu Lys Glu 
305 

Val Pro Phe Ser 



Gly Lys Cys Phe 
340 

Arg Arg Phe Phe 
355 

Asp Asp Glu Ala 
370 

Asp Asp Leu Asp 
385 

Ala Asn Pro Gin 



Glu Cys Leu Val 
420 

Tyr Met Phe Ser 
435 

Ala Gin Leu Asn 
450 

Lys Lys Asp Glu 
465 

Lys lie Pro Asn 



Gly Pro Phe Arg 



lie Thr Glu Gin 
200 

lie Ser Leu Glu 
215 

Glu Ser Leu Arg 
230 

lie His Trp Pro 
245 

lie Ala Ala Tyr 



Thr Val Val Asp 
280 

Thr Val Tyr Cys 
295 

Asp lie lie Gin 
310 

Lys Val Asp Glu 
325 

Ser Val Leu Gly 



Met Gly Asn Gin 
360 

Ala Ala Val Ala 
375 

Met Val Ala lie 
390 

Val Gly Val Ala 
405 

Tyr Val Gin Leu 



Ser Leu Lys Asn 
440 

Ala Val Asp Ala 
455 

Lys Thr Asp Thr 
470 

Pro Arg Phe Gin 
485 



Leu Gly Gly His 
185 

Gin Lys Glu Gly 



Gly Glu Asp Gly 
220 

Lys Leu Cys Val 
235 

Cys Arg Leu Thr 
250 

Lys Ser lie Leu 
265 

Ala Lys Thr Leu 



Leu Asn Asp Asp 
300 

Gly Phe Arg Tyr 
315 

Glu Gin Met Lys 
330 

Phe Cys Lys Ser 
345 

Val Leu Lys Val 



Leu Ser ■ Ser Leu 
380 

Val Arg Tyr Ala 
395 

Phe Pro His lie 
410 

Pro Phe Met Glu 
425 

Ser Lys Lys Tyr 



Leu lie Asp Ser 
460 

Leu Glu Asp Leu 
475 

Arg Leu Phe Gin 
490 



Gly Pro Ser Phe 
190 

Leu Glu lie Val 
205 

Leu Asp Glu lie 



Phe Lys Lys lie 
240 

lie Gly Ser Asn 
255 

Gin Glu Arg Val 
270 

Lys Lys Glu Asp 
285 

Asp Glu Thr Glu 



Gly Ser Asp lie 
320 

Tyr Lys Ser Glu 
335 

Ser Gin Val Gin 
350 

Phe Ala Ala Arg 
365 

lie His Ala Leu 



Tyr Asp Lys Arg 
400 

Lys His Asn Tyr 
415 

Asp Leu Arg Gin 
430 

Ala Pro Thr Glu 
445 

Met Ser Leu Ala 



Phe Pro Thr Thr 
480 

Val Arg Glu Glu 
495 
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Gly 



<210> 38 
<211> 521 
<212> PRT 

<213> Homo sapiens 
<400> 38 

Met Val Arg Ser Gly Asn Lys Ala Ala Val Val Leu Cys Met Asp Val 
15 10 15 

Gly Phe Thr Met Ser Asn Ser lie Pro Gly lie Glu Ser Pro Phe Glu 
20 25 30 

Gin Ala Lys Lys Val lie Thr Met Phe Val Gin Arg Gin Val Phe Ala 
35 40 45 

Glu Asn Lys Asp Glu lie Ala Leu Val Leu Phe Gly Thr Asp Gly Thr 
50 55 60 

Asp Asn Pro Leu Ser Gly Gly Asp Gin Tyr Gin Asn lie Thr Val His 
65 70 75 80 

Arg His Leu Met Leu Pro Asp Phe Asp Leu Leu Glu Asp lie Glu Ser 
85 90 95 

Lys lie Gin Pro Gly Ser Gin Gin Ala Asp Phe Leu Asp Ala Leu lie 
100 105 110 

Val Ser Met Asp Val lie Gin His Glu Thr lie Gly Lys Lys Phe Glu 
115 120 125 

Lys Arg His lie Glu lie Phe Thr Asp Leu Ser Ser Arg Phe Ser Lys 
130 135 140 

Ser Gin Leu Asp lie lie lie His Ser Leu Lys Lys Cys Asp lie Ser 
145 150 155 160 

Leu Gin Phe Phe Leu Pro Phe Ser Leu Gly Lys Glu Asp Gly Ser Gly 
165 170 175 

Asp Arg Gly Asp Gly Pro Phe Arg Leu Gly Gly His Gly Pro Ser Phe 
180 185 190 

Pro Leu Lys Gly lie Thr Glu Gin Gin Lys Glu Gly Leu Glu lie Val 
195 200 205 

Lys Met Val Met lie Ser Leu Glu Gly Glu Asp Gly Leu Asp Glu lie 
210 215 220 

Tyr Ser Phe Ser Glu Ser Leu Arg Lys Leu Cys Val Phe Lys Lys lie 
225 230 235 240 

Glu Arg His Ser lie His Trp Pro Cys Arg Leu Thr lie Gly Ser Asn 
245 250 255 

Leu Ser lie Arg lie Ala Ala Tyr Lys Ser lie Leu Gin Glu Arg Val 
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260 

Lys Lys Thr Trp 
275 

lie Gin Lys Glu 
290 

Leu Asn Pro Pro 
305 

Lys lie Lys Thr 



Val Thr Ala Gin 
340 

Ala Lys Lys Leu 
355 

Ser Leu Ala Glu 
370 



Thr Val Val Asp 
280 

Thr Vai Tyr Cys 
295 

Ala Glu Val Thr 
310 

Leu Phe Pro Leu 
325 

Glu He Phe Gin 



Lys Thr Glu Gin 
360 

Gly Ser Val Thr 
375 



265 

Ala Lys Thr Leu 



Leu Asn Asp Asp 
300 

Thr Lys Ser Gin 
315 

He Glu Ala Lys 
330 

Asp Asn His Glu 
345 

Gly Gly Ala His 



Ser Val Gly Ser 
380 



270 

Lys Lys Glu Asp 
285 

Asp Glu Thr Glu 



He Pro Leu Ser 
320 

Lys Lys Asp Gin 
335 

Asp Gly Pro Thr 
350 

Phe Ser Val Ser 
365 

Val Asn Pro Ala 



Glu Asn Phe Arg 
385 

Ala Ser Asn Gin 



Glu Thr Pro Tyr 
420 

Glu Glu Ala He 
435 

Lys Ala Leu Gin 
450 



Val Leu Val Lys 
390 

Leu He Asn His 
405 

Phe Met Lys Ser 



Lys Phe Ser Glu 
440 

Glu Lys Val Glu 
455 



Gin Lys Lys Ala 
395 

He Glu Gin Phe 
410 

He Asp Cys He 
425 

Glu Gin Arg Phe 



He Lys Gin Leu 
4 60 



Ser Phe Glu Glu 
400 

Leu Asp Thr Asn 
415 

Arg Ala Phe Arg 
430 

Asn Asn Phe Leu 
445 

Asn His Phe Trp 



Glu He Val Val Gin Asp Gly He Thr Leu . He Thr Lys Glu Glu Ala 
465 470 475 480 

Ser Gly Ser Ser Val Thr Ala Glu Glu Ala Lys Lys Phe Leu Ala Pro 
485 490 495 

Lys Asp Lys Pro Ser Gly Asp Thr Ala Ala Val Phe Glu Glu Gly Gly 
500 505 510 

Asp Val Asp Asp Leu Leu Asp Met He 
515 520 



<210> 39 
<211> 437 
<212> PRT 

<213> Homo sapiens 
<400> 39 

Met Gly Cys Gly Cys Ser Ser His Pro Glu Asp Asp Trp Met Glu Asn 
15 10 15 
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lie Asp Val Cys 
20 

Lys Gly Thr Leu 
35 

Val Thr Tyr Glu 
50 

Leu Val He Ala 
65 

Gly Phe Glu Lys 



Trp Trp Lys Ala 
100 

Phe Asn Phe Val 
115 

Phe Lys Asn Leu 
130 

Gly Asn Thr His 
145 

Gly Ser Phe Ser 



Val Val Lys His 
180 

He Ser Pro Arg 
195 

Tyr Thr Asn Ala 
210 

Gin Thr Gin Lys 
225 

Pro Arg Glu Thr 



Gly Glu Val Trp 
260 



Lys Ser Leu Lys 
275 

Ala Asn Leu Met 
290 

Ala Val Val Thr 
305 

Asn Gly Ser Leu 



Thr lie Asn Lys 



Glu Asn Cys His 



Leu He Arg Asn 
40 

Gly Ser Asn Pro 
55 

Leu His Ser Tyr 
70 

Gly Glu Gin Leu 
85 

Gin Ser Leu Thr 



Ala Lys Ala Asn 
120 

Ser Arg Lys Asp 
135 

Gly Ser Phe Leu 
150 

Leu Ser Val Arg 
165 

Tyr Lys lie Arg 



He Thr Phe Pro 
200 

Ser Asp Gly Leu 
215 

Pro Gin Lys Pro 
230 

Leu Lys Leu Val 
245 

Met Gly Tyr Tyr 



Gin Gly Ser Met 
280 

Lys Gin Leu Gin 
295 

Gin Glu Pro He 
310 

Val Asp Phe Leu 
325 

Leu Leu Asp Met 



Tyr Pro He Val 
25 

Gly Ser Glu Val 



Pro Ala Ser Pro 
60 

Glu Pro Ser His 
75 

Arg He Leu Glu 
90 

Thr Gly Gin Glu 
105 

Ser Leu Glu Pro 



Ala Glu Arg Gin 
140 

He Arg Glu Ser 
155 

Asp Phe Asp Gin 
170 

Asn Leu Asp Asn 
185 

Gly Leu His Glu 



Cys Thr Arg Leu 
220 

Trp Trp Glu Asp 
235 

Glu Arg Leu Gly 
250 

Asn Gly His Thr 
265 

Ser Pro Asp Ala 



His Gin Arg Leu 
300 

Tyr He He Thr 
315 

Lys Thr Pro Ser 
330 

Ala Ala Gin He 



Pro Leu Asp Gly 
30 

Arg Asp Pro Leu 
45 

Leu Gin Asp Asn 



Asp Gly Asp Leu 
80 

Gin Ser Gly Glu 
95 

Gly Phe He Pro 
110 

Glu Pro Trp Phe 
125 

Leu Leu Ala Pro 



Glu Ser Thr Ala 
160 

Asn Gin Gly Glu 
175 

Gly Gly Phe Tyr 
190 

Leu Val Arg His 
205 

Ser Arg Pro Cys 



Glu Trp Glu Val 
240 

Ala Gly Gin Phe 
255 

Lys Val Ala Val 
270 

Phe Leu Ala Glu 
285 

Val Arg Leu Tyr 



Glu Tyr Met Glu 
320 

Gly He Lys Leu 
335 

Ala Glu Gly Met 
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340 



345 



350 



Ala Phe lie Glu 
355 

Asn lie Leu Val 
370 

Leu Ala Arg Leu 
385 

Phe Trp Asp Pro 



Pro Arg Asp Asp 
420 

Pro His Gly Ala 
435 



Glu Arg Asn Tyr 
360 

Ser Asp Thr Leu 
375 

lie Glu Asp lie 
390 

Ala Asp Gly Asn 
405 

Gin Pro Gly Gly 



Pro 



lie His Arg Asp 



Ser Cys Lys lie 
380 

His His Gin Val 
395 

Cys His Pro Arg 
410 

Asp Ser Glu Pro 
425 



Leu Arg Ala Ala 
365 

Ala Asp Phe Gly 



Arg Cys Val Val 
400 

Pro His Pro Leu 
415 

Gly Ala Arg Leu 
430 



<210> 40 
<211> 567 
<212> PRT 

<213> Homo sapiens 
<400> 40 . 

Met Gly Cys Gly Cys Ser Ser His Pro Glu Asp Asp Trp Met Glu Asn 
15 10 15 

lie Asp Val Cys Glu Asn Cys His Tyr Pro lie Val Pro Leu Asp Gly 
20 25 30 

Lys Gly Thr Leu Leu lie Arg Asn Gly Ser Glu Val Arg Asp Pro Leu 
35 40 45 

Val Thr Tyr Glu Gly Ser Asn Pro Pro Ala Ser Pro Leu Gin Gly Asp 
50 55 60 

Pro Arg Gin Gin Gly Leu Lys Asp Lys Ala Cys Gly Ser Leu Ala Val 
65 70 75 80 

Gly Phe His Leu Ser Pro Thr Tyr Phe Leu Pro Gly Leu Ala Phe Leu 
85 90 95 

Val Pro His Pro Val Thr Pro Gly Phe Leu Pro lie Pro Ala Arg Phe 
100 105 110 

Ser Leu Thr Pro Leu Val Phe Thr. Asp Asn Leu Val lie Ala Leu His 
115 120 125 

Ser Tyr Glu Pro Ser His Asp Gly Asp Leu Gly Phe Glu Lys Gly Glu 
130 135 140 

Gin Leu Arg lie Leu Glu Gin Ser Gly Glu Trp Trp Lys Ala Gin Ser 
145 150 155 160 

Leu Thr Thr Gly Gin Glu Gly Phe lie Pro Phe Asn Phe Val Ala Lys 
165 170 175 
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Ala Asn Ser Leu Glu Pro Glu Pro Trp Phe Phe Lys Asn Leu Ser Arg 
180 185 190 

Lys Asp Ala Glu Arg Gin Leu Leu Ala Pro Gly Asn Thr His Gly Ser 
195 200 205 

Phe Leu lie Arg Glu Ser Glu Ser Thr Ala Gly Ser Phe Ser Leu Ser 
210 ~ 215 220 

Val Arg Asp Phe Asp Gin Asn Gin Gly Glu Val Val Lys His Tyr Lys 
225 230 ' 235 240 

lie Arg Asn Leu Asp Asn Gly Gly Phe Tyr lie Ser Pro Arg lie Thr 
245 250 255 

Phe Pro Gly Leu His Glu Leu Val Arg His Tyr Thr Asn Ala Ser Asp 
260 265 270 

Gly Leu Cys Thr Arg Leu Ser Arg Pro Cys Gin Thr Gin Lys Pro Gin 
275 280 285 

Lys Pro Trp Trp Glu Asp Glu Trp Glu Val Pro Arg Glu Thr Leu Lys 
290 295 300 

Leu Val Glu Arg Leu Gly Ala Gly Gin Phe Gly Glu Val Trp Met Gly 
305 310 315 320 

Tyr Tyr Asn Gly His Thr Lys Val Ala Val Lys Ser Leu Lys Gin Gly 
325 330 335 

Ser Met Ser Pro Asp Ala Phe Leu Ala Glu Ala Asn Leu Met Lys Gin 
340 345 350 

Leu Gin His Gin Arg Leu Val Arg Leu Tyr Ala Val Val Thr Gin Glu 
355 360 365 

Pro lie Tyr lie lie Thr Glu Tyr Met Glu Asn Gly Ser Leu Val Asp 
370 375 380 

Phe Leu Lys Thr Pro Ser Gly lie -Lys Leu Thr lie Asn Lys Leu Leu 
385 390 395 400 

Asp Met Ala Ala Gin lie Ala Glu Gly Met Ala Phe lie Glu Glu Arg 
405 410 415 

Asn Tyr lie His Arg Asp Leu Arg Ala Ala Asn lie Leu Val Ser Asp 
420 425 430 

Thr Leu Ser Cys Lys lie Ala Asp Phe Gly Leu Ala Arg Leu lie Glu 
435 440 445 

Asp Asn Glu Tyr Thr Ala Arg Glu Gly Ala Lys Phe Pro He Lys Trp 
450 455 460 

Thr Ala Pro Glu Ala He Asn Tyr Gly Thr Phe Thr lie Lys Ser Asp 
465 470 475 480 

Val Trp Ser Phe Gly lie Leu Leu Thr Glu He Val Thr His Gly Arg 
485 490 495 

lie Pro Tyr Pro Gly Met Thr Asn Pro Glu Val He Gin Asn Leu Glu 
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500 

Arg Gly Tyr Arg 
515 

Gin Leu Met Arg 
530 

Phe Asp Tyr Leu 
545 

Gly Gin Tyr Gin 



Met Val Arg Pro 
520 

Leu Cys Trp Lys 
535 



Arg Ser Val Leu 
550 

Pro Gin Pro 
565 



505 

Asp Asn Cys Pro 

Glu Arg Pro Glu 
540 

Glu Asp Phe Phe 
555 



510 

Glu Glu Leu Tyr 
525 

Asp Arg Pro Thr 

Thr Ala Thr Glu 
560 



<210> 41 
<211> 192 
<212> PRT 

<213> Homo sapiens 
<400> 41 

Met Arg lie Ala Val lie Cys Phe Cys Leu Leu Gly lie Thr Cys Ala 
15 10 15 

lie Pro Val Lys Gin Ala Asp Ser Gly Ser Ser Glu Glu Lys Gin Leu 
20 25 30 

Tyr Asn Lys Tyr Pro Asp Ala Val Ala Thr Trp Leu Asn Pro Asp Pro 
35 40 45 

Ser Gin Lys Gin Asn Leu Leu Ala Pro Gin Asn Ala Val Ser Ser Glu 
50 55 60 

Glu Thr Asn Asp Phe Lys Gin Glu Thr Leu Pro Ser Lys Ser Asn Glu 
.65 70 75 80 

Ser His Asp His Met Asp Asp Met Asp Asp Glu Asp Asp Asp Asp His 
85 90. 95 

Val Asp Ser Gin Asp Ser lie Asp Ser Asn Asp Ser Asp Asp Val Asp 
100 105 110 

Asp Thr Asp Asp Ser His Gin Ser Asp Glu Ser His His Ser Asp Glu 
115 120 125 

Ser Asp Glu Leu Val Thr Asp Phe Pro Thr Asp Leu Pro Ala Thr Glu 
130 135 140 

Val Phe Thr Pro Val Val Pro Thr Val Asp Thr Tyr Asp Gly Arg Gly 
145 150 155 160 

Asp Ser Val Val Tyr Gly Leu Arg Ser Lys Ser Lys Lys Phe Arg Arg 
165 170 175 

Pro Asp lie Gin Val Asn Pro Leu Thr Asp Thr Pro Asp Gly Ser Asp 
180 185 190 
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<210> 42 
<211> 109 
<212> PRT 

<213> Homo sapiens 
<400> 42 

Met Glu Leu Gly Leu Pro Gin Val Pro Pro Ala Val Asp Ala Glu Leu 
1 5 10 15 

Leu Cys Arg Phe Val Asp Arg Gly Leu Pro Tyr Pro Asp Val Ser Ser 
20 25 30 



Ala Asn Thr Pro Pro 
35 

Glu Pro Cys Ala Leu 
50 

lie Pro Cys Thr Leu 
65 

Ser His Gly Leu Val 
85 

Asn Asn Gin Trp Val 
100 



Ala Val Gly Leu Ser 
40 

Pro Ser Pro His Arg 
55 

Ala Val Ser Asn Pro 

• 70 

Glu Val Ala Ser Tyr 
90 

Pro Tyr lie Ser Leu 
105 



Pro Pro Thr Pro Tyr Phe 
45 

His Gin Leu Ala Glu Ala 
60 

His Thr Asp Ala Trp Lys 
75 80 

Cys Glu Glu Ser Arg Gly 
95 

Gin Glu Arg 



<210> 43 
<211> 331 
<212> PRT 

<213> Homo sapiens 
<400> 43 

Met Arg Ala Arg Pro Gin Val Cys Glu Ala Leu Leu Phe Ala Leu Ala 
1 5 10 15 

Leu Gin Thr Gly Val Cys Tyr Gly lie Lys Trp Leu Ala Leu Ser Lys 
20 25 30 



Thr Pro Ser Ala Leu Ala Leu Asn Gin Thr Gin His Cys Lys Gin Leu 

35 40 45 

Glu Gly Leu Val Ser Ala Gin Val Gin Leu Cys Arg Ser Asn Leu Glu 

50 55 60 

Leu Met His Thr Val Val His Ala Ala Arg Glu Val Met Lys Ala Cys 

65 70 75 80 

Arg Arg Ala Phe Ala Asp Met Arg Trp Asn Cys Ser Ser lie Glu Leu 

85 90 95 

Ala Pro Asn Tyr Leu Leu Asp Leu Glu Arg Gly Thr Arg Glu Ser Ala 

100 105 110 

Phe Val Tyr Ala Leu Ser Ala Ala Ala lie Ser His Ala lie Ala Arg 

115 120 125 
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Ala Cys Thr Ser Gly Asp Leu Pro Gly Cys Ser Cys Gly Pro Val Pro 
130 135 140 



Gly Glu Pro Pro Gly Pro Gly Asn Arg Trp Gly Arg Cys Ala Asp Asn 
145 150 155 160 

Leu Ser Tyr_ Gly Leu Leu Met Gly Ala Lys Phe Ser Asp Ala Pro Met 
165 170 175 

Lys Val Lys Lys Thr Gly Ser Gin Ala Asn Lys Leu Met Arg Leu His 
180 185 190 

Asn Ser Glu Val Gly Arg Gin Ala Leu Arg Ala Ser Leu Glu Met Lys 
195 200 205 

Cys Lys Cys His Gly Val Ser Gly Ser Cys Ser lie Arg Thr Cys Trp 
210 215 220 

Lys Gly Leu Gin Glu Leu Gin Asp Val Ala Ala Asp Leu Lys Thr Arg 
225 230 235 240 

Tyr Leu Ser Ala Thr Lys Val Val His Arg Pro Met Gly Thr Arg Lys 
245 250 255 

His Leu Val Pro Lys Asp Leu Asp lie Arg Pro Val Lys Asp Ser Glu 
260 265 270 

Leu Val Tyr Leu Gin Ser Ser Pro Asp Phe Cys Met Lys Asn Glu Lys 
275 280 285 

Val Gly Ser His. Gly Thr Gin Asp Arg Gin Cys Asn Lys Thr Ser Asn 
290 295 300 

Gly Ser Asp Ser Cys Asp Leu Met Cys Cys Tyr Val Thr Cys Arg Arg 
305 310 315 320 

Cys Glu Arg Thr Val Glu Arg Tyr Val Cys Lys 
325 330 



<210> 44 
<211> 237 
<212> PRT 

<213> Homo sapiens 
<400> 44 

Met Arg Ala Arg Pro Gin Val Cys Glu Ala Leu Leu Phe Ala Leu Ala 
1 5 10 15 

Leu Gin Thr Gly Val Cys Tyr Gly lie Lys Trp Leu Ala Leu Ser Lys 
. 20 25 30 

Thr Pro Ser Ala Leu Ala Leu Asn Gin Thr Gin His Cys Lys Gin Leu 
35 40 45 

Glu Gly Leu Val Ser Ala Gin Val Gin Leu Cys Arg Ser Asn Leu Glu 
50 55 60 

Leu Met His Thr Val Val His Ala Ala Arg Glu Val Met Lys Ala Cys 
65 70 75 80 
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Arg Arg Ala Phe Ala 
8 5 

Ala Pro Asn Tyr Leu 
100 

Phe Val Tyr Ala Ala 
115 

Lys Val Val His Arg 
130 

Asp Leu Asp lie Arg 
145 

Ser Ser Pro Asp Phe 
165 

Thr Gin Asp Arg Gin 
180 

Asp Leu Met Cys Cys 
195 

Val Glu Arg Cys His 
210 

Arg Arg Cys Glu Arg 

225 . 



Asp Met Arg Trp Asn Cys 
90 

Leu Asp Leu Glu Arg Gly 
105 

Ala Asp Leu Lys Thr Arg 
120 

Pro Met Gly Thr Arg Lys 
135 

Pro Val Lys Asp Ser Glu 
150 155 

Cys Met Lys Asn Glu Lys 
170 

Cys Asn Lys Thr Ser Asn 
185 

Gly Arg Gly Tyr Asn Pro 
200 

Cys Lys Tyr His Trp Cys 
215 

Thr Val Glu Arg Tyr Val 
230 235 



Ser Ser lie Glu Leu 
95 

Thr Arg Glu Ser Ala 
110 

Tyr Leu Ser Ala Thr 
125 

His Leu Val Pro Lys 
140 

Leu Val Tyr Leu Gin 
160 

Val Gly Ser His Gly 
175 

Gly Ser Asp Ser Cys 
190 

Tyr Thr Asp Arg Val 
205 

Cys Tyr Val Thr Cys 
220 

Cys Lys 



<210> 45 
<211> 615 
<212> PRT 

<213> Homo sapiens 
<400> 45 

Met Ser Pro Phe Leu Arg lie Gly Leu Ser Asn Phe Asp Cys Gly Ser 
15 10 15 

Cys Gin Ser Cys Gin Gly Glu Ala Val Asn Pro Tyr Cys Ala Val Leu 
20 25 30 

Val Lys Glu Tyr Val Glu Ser Glu Asn Gly Gin Met Tyr lie Gin Lys 
35 40 45 

Lys Pro Thr Met Tyr Pro Pro Trp Asp Ser Thr Phe Asp Ala His lie 
50 55 60 

Asn Lys Gly Arg Val Met Gin lie lie Val Lys Gly Lys Asn Val Asp 
65 70 75 80 

Leu lie Ser Glu Thr Thr Val Glu Leu Tyr Ser Leu Ala Glu Arg Cys 
85 90 95 

Arg Lys Asn Asn Gly Lys Thr Glu lie Trp Leu Glu Leu Lys Pro Gin 
100 105 110 

Gly Arg Met Leu Met Asn Ala Arg Tyr Phe Leu Glu Met Ser Asp Thr 
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115 



120 



125 



Lys Asp Met Asn 
130 

Arg Arg Gly Ala 
145 

Glu Phe Thr Ala 



His Glu Phe Val 
180 

Cys Asn Ala Ala 

195 . 

Cys Thr Gly Ser 
210 

Arg Phe Lys -lie 
225 

Ser Pro Thr Phe 



Arg Gin Gly Leu 
260 

Cys Gin Thr Lys 
275 

Ala Glu Ala Leu 
290 

Arg Asp Thr Glu 
305 

Pro Cys Ser lie 



Gly Lys Arg Glu 
34.0 

Val Asp Lys Met 
355 

Pro Ser Leu Gin 
370 

Met Leu Gly Lys 
385 

Lys Thr Asn Gin 



Leu Met Asp Asp 
420 

Ser Leu Ala Trp 
435 



Glu Phe Glu Thr 
135 

lie Lys Gin Ala 
150 

Thr Phe Phe Pro 
165 

Trp Gly Leu Asn 



lie His Lys Lys 
200 

Ala lie Asn Ser 
215 

Asp Met Pro His 
230 

Cys Glu His Cys 
245 

Lys Cys Asp Ala 



Val Ala Asn Leu 
280 

Ala Met lie Glu 
295 

Gin lie Phe Arg 
310 

Lys Asn Glu Ala 
325 

Pro Gin Gly lie 



Cys His Leu Pro 
360 

lie Lys Leu Lys 
375 

Gly Ser Phe Gly 
390 

Phe Phe Ala lie 
405 

Asp Val Glu Cys 



Glu His Pro Phe 
440 



Glu Gly Phe Phe 
140 

Lys Val His His 
155 

Gin Pro Thr Phe 
170 

Lys Gin Gly Tyr 
185 

Cys lie Asp Lys 



Arg Glu Thr Met 
220 

Arg Phe Lys Val 
235 

Gly Thr Leu Leu 
250 

Cys Gly Met Asn 
265 

Cys Gly lie Asn 



Ser Thr Gin Gin 
300 

Glu Gly Pro Val 
315 

Arg Pro Pro Cys 
330 

Ser Trp Glu Ser 
345 

Glu Pro Glu Leu 



lie Glu Asp Phe 
380 

Lys Val Phe Leu 
395 

Lys Ala Leu Lys 
410 

Thr Met Val Glu 
425 

Leu Thr His Met 



Ala Leu His Gin 



Val Lys Cys His 
160 

Cys Ser Val Cys 
175 

Gin Cys Arg Gin 
190 

Val lie Ala Lys 
205 

Phe His Lys Glu 



Tyr Asn Tyr Lys 
240 

Trp Gly Leu Ala 
255 

Val His His Arg 
270 

Gin Lys Leu Met 
285 

Ala Arg Cys Leu 



Glu lie Gly Leu 
320 

Leu Pro Thr Pro 
335 

Pro Leu Asp Glu 
350 

Asn Lys Glu Arg 
365 

lie Leu His Lys 



Ala Glu Phe Lys 
400 

Lys Asp Val Val 
415 

Lys Arg Val Leu 
430 

Phe Cys Thr Phe 
445 
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Gin Thr Lys Glu Asn Leu Phe Phe Val Met Glu Tyr Leu Asn Gly Gly 
450 455 460 

Asp Leu Met Tyr His lie Gin Ser Cys His Lys Phe Asp Leu Ser Arg 
465 470 475 480 

Ala Thr Phe Tyr Ala Ala Glu lie lie Le'u Gly Leu Gin Phe Leu His 
485 490 495 

Ser Lys Gly lie Val Tyr Arg Asp Leu- Lys Leu Asp Asn lie Leu Leu 
500 505 510 

Asp Lys Asp Gly His lie Lys lie Ala Asp Phe Gly Met Cys Lys Glu 
515 520 525 

Asn Met Leu Gly Asp Ala Lys Thr Asn Thr Phe Cys Gly Thr Pro Asp 
530 535 540 

Tyr lie Ala Pro Glu lie Leu Leu Gly Gin Lys Tyr Asn His Ser Val 
545 550 555 560 

Asp Trp Trp Ser Phe Gly Val Leu Leu Tyr Glu Met Leu lie Gly Gin 
565 570 575 

Ser Pro Phe His Gly Gin Asp Glu Glu Glu Leu Phe His Ser lie Arg 
580 585 590 

Met Asp Asn Pro Phe Tyr Pro Arg Trp Leu Glu Lys Glu Ala Lys Asp 
595 600 605 

Leu Leu Val Lys Val Arg Ser 
610 615 



<210> 46 
<211> 292 
<212> PRT 

<213> Homo sapiens 
<400> 46 

Met Pro lie Thr Arg Met Arg Met Arg Pro Trp Leu Glu Met Gin lie 
1 5 10 15 

Asn Ser Asn Gin lie Pro Gly Leu lie Trp He Asn Lys Glu Glu Met 
20 25 30 

.He Phe Gin lie Pro Trp Lys His Ala Ala Lys His Gly Trp Asp He 
35 40 45 

Asn Lys Asp Ala Cys Leu Phe Arg Ser Trp Ala He His Thr Gly Arg 
50 55 60 

Tyr Lys Ala Gly Glu Lys Glu Pro Asp Pro Lys Thr Trp Lys Ala Asn 
65 70 75 80 

Phe Arg Cys Ala Met Asn Ser Leu Pro Asp He Glu Glu Val Lys Asp 
85 90 95 

Gin Ser Arg Asn Lys Gly Ser Ser Ala Val Arg Val Tyr Arg Met Leu 
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100 



105 



110 



Pro Pro Leu Thr Lys Asn Gin Arg Lys Glu Arg Lys Ser Lys Ser Ser 
115 120 125 

Arg Asp Ala Lys Ser Lys Ala Lys Arg Lys Ser Cys Gly Asp Ser Ser 
130 135 140 

Pro Asp Thr Phe Ser Asp Gly Leu Ser Ser Ser Thr Leu Pro Asp Asp 
145 150 155 160 

His Ser Ser Tyr Thr Val Pro Gly Tyr Met Gin Asp Leu Glu Val Glu 
165 170 175 

Gin Ala Leu Thr Pro Ala Leu Ser Pro Cys Ala Val Ser Ser Thr Leu 
180 185 190 

Pro Asp Trp His He Pro Val Glu Val Val Pro Asp Ser Thr Ser Asp 
195 200 205 

Leu Tyr Asn Phe Gin Val Ser Pro Met Pro Ser Thr Ser Glu Ala Thr 
210 215 220 

Thr Asp Glu Asp Glu Glu Gly Lys Leu Pro Glu Asp He Met Lys Leu 
225 230 235 240 

Leu Glu Gin Ser Glu Trp Gin Pro Thr Asn Val Asp Gly Lys Gly Tyr 
245 250 255 

Leu Leu Asn Glu Pro Gly Val Gin Pro Thr Ser Val Tyr Gly Asp Phe 
260 265 270 

Ser Cys Lys Glu Glu Pro Glu He Asp Ser Pro Gly Gly Lys Lys Ala 
275 280 285 

Pro Gly Ser Leu 
290 



<210> 47 
<211> 702 
<212> PRT 

<213> Homo sapiens 
<400> 47 

Met Trp Ser Trp Lys Cys Leu Leu Phe Trp Ala Val Leu Val Thr Ala 
15 10 15 

Thr Leu Cys Thr Ala Arg Pro Ser Pro Thr Leu Pro Glu Gin Ala Gin 
20 25 30 

Pro Trp Gly Ala Pro Val Glu Val Glu Ser Phe Leu Val His Pro Gly 
35 40 45 

Asp Leu Leu Gin Leu Arg Cys Arg Leu Arg Asp Asp Val Gin Ser lie 
50 55 60 

Asn Trp Leu Arg Asp Gly Val Gin Leu Ala Glu Ser Asn Arg Thr Arg 
65 70 75 80 
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lie Thr Gly Glu Glu Val Glu Val Gin Asp Ser Val Pro Ala Asp Ser 
85 90 95 



Gly Leu Tyr Ala 
100 

Tyr Phe Ser Val 
115 

Asp Asp Asp Asp 
130 

Lys Pro Asn Arg 
145 

Met Glu Lys Lys 



Lys Cys Pro Ser 
180 

Asn Gly Lys Glu 
195 

Arg Tyr Ala Thr 
210 

Lys Gly Asn Tyr 
225 

His Thr Tyr Gin 



Cys Val Thr Ser 



Asn Val Ser Asp 
120 

Asp Ser Ser Ser 
135 

Met Pro Val Ala 
150 

Leu His Ala Val 
165 

Ser Gly Thr Pro 



Phe Lys Pro Asp 
200 

Trp Ser lie lie 
215 

Thr Cys lie Val 
230 

Leu Asp Val Val 
245 



Ser Pro Ser Gly 
105 

Ala Leu Pro Ser 



Glu Glu Lys Glu 
140 

Pro Tyr Trp Thr 
155 

Pro Ala Ala Lys 
170 

Asn Pro Thr Leu 
185 

His Arg He Gly 



Met Asp Ser Val 
220 

Glu Asn Glu Tyr 
235 

Glu Arg Ser Pro 
250 



Ser Asp Thr Thr 
110 

Ser Glu Asp Asp 
125 

Thr Asp Asn Thr 



Ser Pro Glu Lys 
160 

Thr Val Lys Phe 
175 

Arg Trp Leu Lys 
190 

Gly Tyr Lys Val 
205 

Val Pro Ser Asp 



Gly Ser He Asn 
240 

His Arg Pro He 
255 



Leu Gin Ala Gly Leu Pro Ala Asn Lys Thr Val Ala Leu Gly Ser Asn 
260 265 270 

Val Glu Phe Met Cys Lys Val Tyr Ser Asp Pro Gin Pro His He Gin 
275 280 285 

Trp Leu Lys His He Glu Val Asn Gly Ser Lys He Gly Pro Asp Asn 
290 295 300 

Leu Pro Tyr Val Gin He Leu Lys Thr Ala Gly Val Asn Thr Thr Asp 
305 310 315 320 

Lys Glu Met Glu Val Leu His Leu Arg Asn Val Ser Phe Glu Asp Ala 
325 330 335 

Gly Glu Tyr Thr Cys Leu Ala Gly Asn Ser He Gly Leu Ser His His 
340 345 350 

Ser Ala Trp Leu Thr Val Leu Glu Ala Leu Glu Glu Arg Pro Ala Val 
355 360 365 

Met Thr Ser Pro Leu Tyr Leu Glu He He He Tyr Cys Thr Gly Ala 
370 375 380 

Phe Leu He Ser Cys Met Val Gly Ser Val He Val Tyr Lys Met Lys 
385 390 395 400 

Ser Gly Thr Lys Lys Ser Asp Phe His Ser Gin Met Ala Val His Lys 
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405 410 415 

Leu Ala Lys Ser He Pro Leu Arg Arg Gin Val Thr Val Ser Ala Asp 
420 425 430 

Ser Ser Ala Ser Met Asn Ser Gly Val Leu Leu Val Arg Pro Ser Arg 
435 440 445 

Leu Ser Ser Ser Gly Thr Pro Met Leu Ala Gly Val Ser Glu Tyr Glu 
450 455 460 

Leu Pro Glu Asp Pro Arg Trp Glu Leu Pro Arg Asp Arg Leu Val Leu 
465 470 475 480 

Gly Lys Pro Leu Gly Glu Gly Cys Phe Gly Gin Val Val Leu Ala Glu 
485 490 495 

Ala He Gly Leu Asp Lys Asp Lys Pro Asn Arg Val Thr Lys Val Ala 
500 505 510 

Val Lys Met Leu Lys Ser Asp Ala Thr Glu Lys Asp Leu Ser Asp Leu 
515 520 525 

lie Ser Glu Met Glu Met Met Lys Met He Gly Lys His Lys Asn He 
530 535 540 

He Asn Leu Leu Gly Ala Cys Thr Gin Asp Gly Pro Leu Tyr Val He 
545 550 555 560 

Val Glu Tyr Ala Ser Lys Gly Asn Leu Arg Glu Tyr Leu Gin Ala Arg 
565 570 575 

Arg Pro Pro Gly Leu Glu Tyr Cys Tyr Asn Pro Ser His Asn Pro Glu 
580 585 590 

Glu Gin Leu Ser Ser Lys Asp Leu Val Ser Cys Ala Tyr Gin Val Ala 
595 600 605 

Arg Gly Met Glu Tyr Leu Ala Ser Lys Lys • Cys lie His Arg Asp Leu 
610 615 620 

Ala Ala Arg Asn Val Leu Val Thr Glu Asp Asn Val Met Lys He Ala 
625 630 635 640 

Asp Phe Gly Leu Ala Arg Asp He His His He Asp Tyr Tyr Lys Lys 
645 650 655 

Thr Thr Asn Gly Arg Leu Pro Val Lys Trp Met Ala Pro Glu Ala Leu 
660 665 670 

Phe Asp Arg He Tyr Thr His Gin Ser Asp Val Trp Ser Phe Gly Val 
675 680 685 

Pro His Thr Pro Val Cys Leu Trp Arg Asn Phe Ser Ser Cys 
690 695 700 



<210.> 48 
<211> 295 
<212> PRT 
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<213> Homo sapiens 
<400> 48 

Met Pro Lys Arg Gly Lys Lys Gly Ala Val Ala Glu Asp Gly Asp Glu 
1 5 10 15 

Leu Arg Thr Glu Pro Glu Ala Lys Lys Ser Lys Thr Ala Ala Lys Lys 
20 25 30 

Asn Asp Lys Glu Ala Ala Gly Glu Gly Pro Ala Leu Tyr Glu Asp Pro 
35 40 45 

Pro Asp Gin Lys Thr Ser Pro Ser Gly Lys Pro Ala Thr Leu Lys lie 
50 55 60 

Cys Ser Trp Asn Val Asp Gly Leu Arg Ala Trp lie Lys Lys Lys Gly 
65 70 75 80 

Leu Asp Trp- Val Lys Glu Glu Ala Pro Asp lie Leu Cys Leu Gin Glu 
85 90 95 

Thr Lys Cys Ser Glu Asn Lys Leu Pro Ala Glu Leu Gin Glu Leu Pro 
100 105 110 

Gly Leu Ser His Gin Tyr Trp Ser Ala Pro Ser Asp Lys Glu Gly Tyr 
115 120 125 

Ser Gly Val Gly Leu Leu Ser Arg Gin Cys Pro Leu Lys Val Ser Tyr 
130 135 140 

Gly lie Ala Tyr Val Pro Asn Ala Gly Arg Gly Leu Val Arg Leu Glu 
145 150 155 160 

Tyr Arg Gin Arg Trp Asp Glu Ala Phe Arg Lys Phe Leu Lys Gly Leu 
165 170 175 

Ala Ser Arg Lys Pro Leu Val Leu Cys Gly Asp Leu Asn Val Ala His 
180 185 190 

Glu Glu lie Asp Leu Arg Asn Pro Lys Gly Asn Lys Lys Asn Ala Gly 
195 200 205 

Phe Thr Pro Gin Glu Arg Gin Gly Phe Gly Glu Leu Leu Gin Ala Val 
210 215 220 

Pro Leu Ala Asp Ser Phe Arg His Leu Tyr Pro Asn Thr Pro Tyr Ala 
225 230 235 240 

Tyr Thr Phe Trp Thr Tyr Met Met Asn Ala Arg Ser Lys Asn Val Gly 
245 250 255 

Trp Arg Leu Asp Tyr Phe Leu Leu Ser His Ser Leu Leu Pro Ala Leu 
260 265 270 

Cys Asp Ser Lys lie Arg Ser Lys Ala Leu Gly Ser Asp His Cys Pro 
275 280 285 

lie Thr Leu Tyr Leu Ala Leu 
290 295 
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<210> 49 
<211> 342 
<212> PRT 

<213> Homo sapiens 



<400> 49 

Met Pro Lys Arg Gly Lys Lys Gly Ala Val Ala Glu Asp Gly Asp Glu 
15 10 15 

Leu Arg Thr Gly Lys Gly Met Lys Ser Ala Leu Leu Pro Arg Asn Cys 
20 25 30 

Gly Gly Gly Val Cys His Ser Leu Asp Val Arg Glu Pro Glu Ala Lys 
35 40 45 

Lys Ser Lys Thr Ala Ala Lys Lys Asn Asp Lys Glu Ala Ala Gly Glu 
50 55 60 

Gly Pro Ala Leu Tyr Glu Asp Pro Pro Asp Gin Lys Thr Ser Pro Ser 
65 70 ; 75 80 

Gly Lys Pro Ala Thr Leu Lys He Cys Ser Trp Asn Val Asp Gly Leu 
85 90 95 

Arg Ala Trp He Lys Lys Lys Gly Leu Asp Trp Val Lys Glu Glu Ala 
100 105 110 

Pro Asp He Leu Cys Leu Gin Glu Thr Lys Cys Ser Glu Asn Lys Leu 
115 120 125 

Pro Ala Glu Leu Gin Glu Leu Pro Gly Leu Ser His Gin Tyr Trp Ser 
130 135 140 

Ala Pro Ser Asp Lys Glu Gly Tyr Ser Gly Val Gly Leu Leu Ser Arg 
145 150 155 160 

Gin Cys Pro Leu Lys Val Ser Tyr Gly He Gly Asp Glu Glu His Asp 
165 170- 175 

Gin Glu Gly Arg Val He Val Ala Glu Phe Asp Ser Phe Val Leu Val 
180 185 190 

Thr Ala Tyr Val Pro Asn Ala Gly Arg Gly Leu Val Arg Leu Glu Tyr 
195 200 205 

Arg Gin Arg Trp Asp Glu Ala Phe Arg Lys Phe Leu Lys Gly Leu Ala 
210 215 220 

Ser Arg Lys Pro Leu Val Leu Cys Gly Asp Leu Asn Val Ala His Glu 
225 230 235 240 

Glu He Asp Leu Arg Asn Pro Lys Gly Asn Lys Lys Asn Ala Gly Phe 
245 250 255 

Thr Pro Gin Glu Arg Gin Gly Phe Gly Glu Leu Leu Gin Ala Val Pro 
260 265 270 

Leu Ala Asp Ser Phe Arg His Leu Tyr Pro Asn Thr Pro Tyr Ala Tyr 
275 280 285 
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Thr Phe Trp Thr Tyr 
290 

Arg Leu Asp Tyr Phe 
305 

Asp Ser Lys He Arg 
325 

Thr Leu Tyr Leu Ala 
340 



Met Met Asn Ala Arg Ser 
295 

Leu Leu Ser His Ser Leu 
310 315 

Ser Lys Ala Leu Gly Ser 
330 

Leu 



Lys Asn Val Gly Trp 
300 

Leu Pro Ala Leu Cys 
320 

Asp His Cys Pro He 

3351 



<210> 50 
<211> 305 
<212> PRT 

<213> Homo sapiens 
<400> 50 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 
1 5 10 15 

Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
20 25 30 

Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 40 45 

Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 55 60 

Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

Ala He lie His Glu Glu Lys Ala Leu Thr Met Glu Val He Arg Gin 
85 90 95 

Val Lys Gly Asp Leu Ala Phe Leu Asn Phe Gin Asn Asn Leu Gin Gin 
100 105 110 

Thr Pro Leu His Leu' Ala Val He Thr Asn Gin Pro Glu He Ala Glu 
115 120 125 

Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 135 140 

Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155 160 

Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser He Leu 
165 170 175 

Lys Ala Thr Asn Tyr Asn Gly His Thr Cys Leu His Leu Ala Ser He 
180 185 190 

His Gly Tyr Leu Gly He Val Glu Leu Leu Val Ser Leu Gly Ala Asp 
195 200 205 

Val Asn Ala Gin Glu Pro Cys Asn Gly Arg Thr Ala Leu His Leu Ala 
210 215 220 
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Val Asp Leu Gin 
225 

Ala Asp Val Asn 



Thr Trp Gly Arg 
260 

Thr Leu Glu Asn 
275 

Tyr Asp Thr Glu 
290 



Asn Pro Asp Leu 
230 

Arg Val Thr Tyr 
245 

Pro Ser Thr Arg 



Leu Gin Met Leu 
280 

Ser Glu Phe Thr 
295 



Val Ser Leu Leu 
235 

Gin Gly Tyr Ser 
250 

lie Gin Gin Gin 
265 

Pro Glu Ser Glu 



Glu Phe Thr Glu 
300 



Leu Lys Cys Gly 
240 

Pro Tyr Gin Leu 
255 

Leu Gly Gin Leu 
270 

Asp Glu Glu Ser 
285 

Asp Glu Val Ser 



Leu 
305 



<210> 51 

<211> 289 

<212> PRT 

<213> Homo sapiens 

<400> 51 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 

1.5 10 15 

Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
20 25 30 

Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 40 45 

Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 55 60 

Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

Ala lie lie His Glu Glu Lys Ala Leu Thr Met Glu Val lie Arg Gin 
85 90 95 

Val Lys Gly Asp Leu Ala Phe Leu Asn Phe Gin Asn Asn Leu Gin Gin 
100 105 110 

Thr Pro Leu His Leu Ala Val lie Thr Asn Gin Pro Glu lie Ala Glu 
115 120 125 

Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 135 140 

Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155 160 

Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser lie Leu 
165 170 175 

Lys Ala Thr Asn Tyr Asn Gly Gin Glu Pro Cys Asn Gly Arg Thr Ala 
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180 



185 



190 



Leu His Leu Ala 
195 

Leu Lys Cys Gly 
210 

Pro Tyr Gin Leu 
225 

Leu Gly Gin Leu 



Asp Glu Glu Ser 
260 

Asp Glu Leu Pro 
275 

Leu 



Val Asp Leu Gin 
200 

Ala Asp Val Asn 
215 

Thr Trp Gly Arg 
230 

Thr Leu Glu Asn 
245 

Tyr Asp Thr Glu 



Tyr Asp Asp Cys 
280 



Asn Pro Asp Leu 



Arg Val Thr Tyr 
220 

Pro Ser Thr Arg 
235 

Leu Gin Met Leu 
250 

Ser Glu Phe Thr 

265 " 

Val Phe Gly Gly 



Val Ser Leu Leu 
205 

Gin Gly Tyr Ser 



lie Gin Gin Gin 
240 

Pro Glu Ser Glu 
255 

Glu Phe Thr Glu 
270 

Gin Arg Leu Thr 
285 



<210> 52 
<211> 921 
<212> FRT 

<213> Homo sapiens 
<400> 52 

Met Ala Gly lie Phe Tyr Phe Ala Leu Phe Ser Cys Leu Phe Gly lie 
15 10 15 

Cys Asp Ala Val Thr Gly Ser Arg Val Tyr Pro Ala Asn Glu Val Thr 
20 25 30 

Leu Leu Asp Ser Arg Ser Val Gin Gly Glu Leu Gly Trp lie Ala Ser 
35 40 45 

Pro Leu Glu Gly Gly Trp Glu Glu Val Ser lie Met Asp Glu Lys Asn 
50 55 60 

Thr Pro lie Arg Thr Tyr Gin Val Cys Asn Val Met Glu Pro Ser Gin 
65 70 75 80 

Asn Asn Trp Leu Arg Thr Asp Trp lie Thr Arg Glu Gly Ala Gin Arg 
85 90 95 

Val Tyr lie Glu lie Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu Pro 
100 105 110 

Gly Val Met Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr Glu 
115 120 125 

Ser Asp Asn Asp Lys Glu Arg Phe lie Arg Glu Asn Gin Phe Val Lys 
130 135 140 

lie Asp Thr lie Ala Ala Asp Glu Ser Phe Thr Gin Val Asp lie Gly 
145 150 155 160 
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Asp Arg lie Met Lys Leu Asn Thr Glu lie 
165 170 



Arg Asp 



Val 



Gly 



Pro Leu 
175 



Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gin Asp Val Gly Ala Cys lie 
180 185 190 

Ala Leu Val Ser Val Arg Val Phe Tyr Lys Lys Cys Pro Leu Thr Val 
195 200 2-05 

Arg Asn Leu Ala Gin Phe Pro Asp Thr lie Thr Gly. Ala Asp Thr Ser 
210 215 220 

Ser Leu Val Glu Val Arg Gly Ser Cys Val Asn Asn Ser Glu Glu Lys 
225 230 235 240 

Asp Val Pro Lys Met Tyr Cys Gly Ala Asp Gly Glu Trp Leu Val Pro 
245 250 255 

lie Gly Asn Cys Leu Cys Asn Ala Gly His Glu Glu Arg Ser Gly Glu 
260 265 270 

Cys Gin Ala Cys Lys lie Gly Tyr Tyr Lys Ala Leu Ser Thr Asp Ala 
275 280 285 

Thr Cys Ala Lys Cys Pro Pro His Ser Tyr Ser Val Trp Glu Gly Ala 
290 295 300 

Thr Ser Cys Thr Cys Asp Arg Gly Phe Phe Arg Ala Asp Asn Asp Ala 
305 310 315 320 

Ala Ser Met Pro Cys Thr Arg Pro Pro Ser Ala Pro Leu Asn Leu lie 
325 330 335 

Ser Asn Val Asn Glu Thr Ser Val Asn Leu' Glu Trp Ser Ser Pro Gin 
340 345 350 

Asn Thr Gly Gly Arg Gin Asp lie Ser Tyr Asn Val Val Cys Lys Lys 
355 360 365 

Cys Gly Ala Gly Asp Pro Ser Lys Cys Arg Pro Cys Gly Ser Gly Val 
370 375 380 

His Tyr Thr Pro Gin Gin Asn Gly Leu Lys Thr Thr Lys Val Ser lie 
385 390 395 400 

Thr Asp Leu Leu Ala His Thr Asn Tyr Thr Phe Glu lie Trp Ala Val 
405 410 415 

Asn Gly Val Ser Lys Tyr Asn Pro Asn Pro Asp Gin Ser Val Ser Val 
420 425 430 

Thr Val Thr Thr Asn Gin Ala Ala Pro Ser Ser lie Ala Leu Val Gin 
435 440 445 

Ala Lys Glu Val Thr Arg Tyr Ser Val Ala Leu Ala Trp Leu Glu Pro 
450 455 460 

Asp Arg Pro Asn Gly Val lie Leu Glu Tyr Glu Val Lys Tyr Tyr Glu 
465 470 475 480 

Lys Asp Gin Asn Glu Arg Ser Tyr Arg lie Val Arg Thr Ala Ala Arg 
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485 



490 



495 



Asn Thr Asp lie Lys Gly Leu Asn Pro Leu Thr Ser Tyr Val Phe His 
500 505 510 

Val Arg Ala Arg Thr Ala Ala Gly Tyr Gly Asp Phe Ser Glu Pro Leu 
515 520 525 

Glu Val Thr Thr Asn Thr Val Pro Ser Arg lie lie Gly Asp Gly Ala 
530 535 540 

Asn Ser Thr Val Leu Leu Val Ser Val Ser Gly Ser Val Val Leu Val 
545 550 555 560 



Val lie Leu lie Ala Ala Phe Val lie Ser Arg Arg Arg Ser Lys Tyr 
565 570 575 



Ser Lys Ala Lys Gin Glu Ala Asp Glu Glu Lys His Leu Asn Gin Gly 
580 585 590 

Val Arg Thr Tyr Val Asp Pro Phe Thr Tyr Glu Asp Pro Asn Gin Ala 
595 600 605 

Val Arg Glu Phe Ala Lys Glu lie Asp Ala Ser Cys lie Lys lie Glu 
610 615 620 

Lys Val lie Gly Val Gly Glu Phe Gly Glu Val Cys Ser Gly Arg Leu 
625 630 635 640 



Lys Val Pro Gly Lys Arg Glu lie Cys Val Ala lie Lys Thr Leu Lys 
645 650 655 



Ala Gly Tyr Thr Asp Lys Gin Arg 
660 

lie Met Gly Gin Phe Asp His Pro 
675 680 



Arg Asp Phe Leu Ser Glu Ala Ser 
665 670 

Asn lie lie His Leu Glu Gly Val 
685 



Val Thr Lys Cys Lys Pro Val Met lie He. Thr Glu Tyr Met Glu Asn 

690 695 700 

Gly Ser Leu Asp Ala Phe Leu Arg Lys Asn Asp Gly Arg Phe Thr Val 

705 710 715 720 



He Gin Leu Val Gly Met Leu Arg Gly He Gly Ser Gly Met Lys Tyr 

725 730 735 

Leu Ser Asp Met Ser Tyr Val His Arg Asp Leu Ala Ala Arg Asn He 

740 745 ' 750 



Leu Val Asn Ser Asn Leu Val Cys Lys Val Ser Asp Phe Gly Met Ser 
755 760 765 

Arg Val Leu Glu Asp Asp Pro Glu Ala Ala Tyr Thr Thr Arg Gly Gly 
770 775 780 

Lys He Pro He Arg Trp Thr Ala Pro Glu Ala He Ala Tyr Arg Lys 
785 790 795 800 

Phe Thr Ser Ala Ser Asp Val Trp Ser Tyr Gly He Val Met Trp Glu 
805 810 815 
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Val Met Ser Tyr 
820 

Pro Asn Thr Ala 
835 

Val Ser Val Gly 
850 

Asp Asn Phe Thr 
865 

Val Asn Gin Glu 



Gin Asn Lys lie 
900 

Gin Met His Gly 
915 



Gly Glu Arg Pro 



Leu Leu Asp Pro 
840 

Asp Trp Leu Gin 
855 

Ala Ala Gly Tyr 
870 

Asp Leu Ala Arg 
885 

Leu Ser Ser Val 



Arg Met Val Pro 
920 



Tyr Trp Asp Met 
825 

Ser Ser Pro Glu 



Ala lie Lys Met 
860 

Thr Thr Leu Glu 
875 

lie Gly lie Thr 
890 

Gin Ala Met Arg 
905 

Val 



Ser Asn Gin Asp 
830 

Phe Ser Ala Val 
845 

Asp Arg Tyr Lys 



Ala Val Val His 
880 

Ala lie Thr His 
895 

Thr Gin Met Gin 
910 



<210> 53 
<211> 444 
<212> PRT 

<213> Homo sapiens 
<400>' 53 

Met Asn Asp Phe Gly lie Lys Asn Met Asp Gin Val Ala Pro Val Ala 
15 10 15 

Asn Ser Tyr Arg Gly Thr Leu Lys Arg Gin Pro Ala Phe Asp Thr Phe 
20 25 30 

Asp Gly Ser Leu Phe Ala Val Phe Pro Ser Leu Asn Glu Glu Gin Thr 
35 40 45 

Leu Gin Glu Val Pro Thr Gly Leu Asp Ser lie Ser His Asp Ser Ala 
50 55 60 

Asn Cys Glu Leu Pro Leu Leu Thr Pro Cys Ser Lys Ala Val Met Ser 
65 70 75 80 

Gin Ala Leu Lys Ala Thr Phe Ser Gly Phe Phe Trp Ala Thr Asn Glu 
85 90 95 

Phe Ser Leu Val Asn Val Asn Leu Gin Arg Phe Gly Met Asn Gly Gin 
100 105 110 

Met Leu Cys Asn Leu Gly Lys Glu Arg Phe Leu Glu Leu Ala Pro Asp 
115 120 125 

Phe Val Gly Asp lie Leu Trp Glu His Leu Glu Gin Met lie Lys Glu 
130 135 140 

Asn Gin Glu Lys Thr Glu Asp Gin Tyr Glu Glu Asn Ser His Leu Thr 
145 150 155 160 

Ser Val Pro His Trp lie Asn Ser Asn Thr Leu Gly Phe Gly Thr Glu 
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165 



170 



175 



Gin Ala Pro Tyr Gly Met Gin Thr Gin Asn Tyr Pro Lys Gly Gly Leu 
180 185 190 

Leu Asp Ser Met Cys Pro Ala Ser Thr Pro Ser Val Leu Ser Ser Glu 
195 200 205 

Gin Glu Phe Gin Met Phe Pro Lys Ser Arg Leu Ser Ser Val Ser Val 
210 215 220 

Thr Tyr Cys Ser Val Ser Gin Asp Phe Pro Gly Ser Asn Leu Asn Leu 
225 230 235 240 

Leu Thr Asn Asn Ser Gly Thr Pro Lys Asp His Asp Ser Pro Glu Asn 
245 250 255 

Gly Ala Asp Ser Phe Glu Ser Ser Asp Ser Leu Leu Gin Ser Trp Asn 
260 265 270 

Ser Gin Ser Ser Leu Leu Asp Val Gin Arg Val Pro Ser Phe Glu Ser 
275 280 285 

Phe Glu Asp Asp Cys Ser Gin Ser Leu Cys Leu Asn Lys Pro Thr Met 
290 295 300 

Ser Phe Lys Asp Tyr lie Gin Glu Arg Ser Asp Pro Val Glu Gin Gly 
305 310 315 320 

Lys Pro Val lie Pro Ala Ala Val Leu Ala Gly Phe Thr Gly Ser Gly 
325 330 335 

Pro lie Gin Leu Trp Gin Phe Leu Leu Glu Leu Leu Ser Asp Lys Ser 
340 345 350 

Cys Gin Ser Phe lie Ser Trp Thr Gly Asp Gly Trp Glu Phe Lys Leu 
355 360 365 

Ala Asp Pro Asp Glu Val Ala Arg Arg Trp* Gly Lys Arg Lys Asn Lys 
370 375 380 

Pro Lys Met Asn Tyr Glu Lys Leu Ser Arg Gly Leu Arg Tyr Tyr Tyr 
385 390 395 400 

Asp Lys Asn lie lie His Lys Thr Ser Gly Lys Arg Tyr Val Tyr Arg 
405 410 415 

Phe Val Cys Asp Leu Gin Asn Leu Leu Gly Phe Thr Pro Glu Glu Leu 
420 425 430 

His Ala lie Leu Gly Val Gin Pro Asp Thr Glu Asp 
435 440 



<210> 54 
<211> 260 
<212> PRT 

<213> Homo sapiens 
<400> 54 
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Met Ala Giy Ser Ala Met Ser Ser Lys Phe Phe Leu Val Ala Leu Ala 
1 . 5 10 15 

lie Phe Phe Ser Phe Ala Gin Val Val lie Glu Ala Asn Ser Trp Trp 
20 25 30 

Ser Leu Gly Met Asn Asn Pro Val Gin Met Ser Glu Val Tyr Ile_ lie 
35 4 0 45 . 

Gly Ala Gin Pro Leu Cys Ser Gin Leu Ala Gly Leu Ser Gin. Gly Gin 
50 55 ■ 60 

Lys Lys Leu Cys His Leu Tyr Gin Asp His Met Gin Tyr lie Gly Glu 
65 70 75 80 

Gly Ala Lys Thr Gly lie Lys Glu Cys Gin Tyr Gin Phe Arg His Arg 
85 90 95 

Arg Trp Asn Cys Ser Thr Val Asp Asn Thr Ser Val Phe Gly Arg Val 
100 105 110 

Met Gin lie Gly Ser Arg Glu Thr Ala Phe Thr Tyr Ala Val Ser Ala 
115 120 125 

Ala Gly Val Val Asn Ala Met Ser Arg Ala Cys Arg Glu Gly Glu Leu 
130 135 140 

Ser Thr Cys Gly Cys Ser Arg Ala Ala Arg Pro Lys Asp Leu Pro Arg 
145 150 155 160 

Asp Trp Leu Trp Gly Gly Cys Gly Asp Asn lie Asp Tyr Gly Tyr Arg 
165 170 175 

Phe Ala Lys Glu Phe Val Asp Ala Arg Glu Arg Glu Arg lie His Ala 
180 185 190 

Lys Gly Ser Tyr Glu Ser Ala Arg lie Leu Met Asn Leu His Asn Asn 
195 200 205 

Glu Ala Gly Arg Arg Thr Val Tyr Asn Leu Ala Asp Val Ala Cys Lys 
210 215 220 

Cys His Gly Val Ser Gly Ser Cys Ser Leu Lys Thr Cys Trp Leu Gin 
225 230 235 240 

Leu Ala Asp Phe Arg Lys Val Gly Asp Ala Leu Lys Glu Lys Tyr Asp 
245 250 255 



Thr Leu Val Gly 
260 



<210> 55 
<211> 719 
<212> PRT 

<213> Homo sapiens 



<400> 55 

Met Ala Leu Arg Arg Ser Met Gly Arg Pro Gly Leu Pro Pro Leu Pro 
15 10 15 
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Leu Pro Pro Pro 
20 

Ala Gly Leu Lys 
35 

Gly Gin Pro Val 
50 

Asp lie Gin Trp 
65 

Leu Tyr lie Pro 



Lys Ser Val Glu 
100 

Asp Gly Gly Glu 
115 

Gly Val Pro Phe 
130 

Asn Ala Pro Phe 
145 

Val Thr He Val 



Pro Ser Pro Ser 
180 

Phe Ser Cys Glu 
195 

Ala Thr Val His 
210 

Val Thr Lys Leu 
225 

Ala Asp Gly Arg 



Ala Pro Gly Gly 
260 

Phe Thr Cys Leu 
275 

Arg Val Arg Cys 
290 

Val Pro Phe Gin 
305 

Leu His Ala He 



Pro Arg Leu Gly 



Leu Met Gly Ala 
40 

Lys Leu Asn Cys 
55 

Val Lys Asp Gly 
70 

Val Ser Glu Gin 
85 

Arg Ser Asp Ala 



Thr Glu He Ser 
120 

Phe Thr Val Glu 
135 

Gin Leu Ser Cys 
150 

Trp Trp Arg Gly 
165 

Val Leu Asn Val 



Ala His Asn Leu 
200 

Leu Gin Ala Leu 
215 

Ser Ser Ser Asn 
' 230 

Ala Leu Leu Gin 
245 

Trp Glu Val Leu 



Leu Arg Asp Leu 
280 

Ala Asn Ala Leu 
295 

Thr Lys Gly Leu 
310 

Arg Thr Asp Ser 
325 



Leu Leu Leu Ala 
25 

Pro Val Lys Leu 



Ser Val Glu Gly 
60 

Ala Val Val Gin 
75 

His Trp He Gly 
90 

Gly Arg Tyr Trp 
105 

Gin Pro Val Trp. 



Pro Lys Asp Leu 
140 

Glu Ala Val Gly 
. 155 

Thr Thr Lys He 
170 

Thr Gly Val Thr 
185 

Lys Gly Leu Ala 



Pro Ala Ala Pro 
220 

Ala Ser Val Ala 
235 

Ser Cys Thr Val 
250 

Ala Val Val Val 
265 

Val Pro Ala Thr 



Gly Pro Ser Pro 
300 

Ala Pro Ala Ser 
315 

Gly Leu He Leu 
330 



Glu Ser Ala Ala 
30 

Thr Val Ser Gin 
45 

Met Glu Glu Pro 



Asn Leu Asp Gin 
80 

Phe Leu Ser Leu 
95 

Cys Gin Val Glu 
110 

Leu Thr Val Glu 
125 

Ala Val Pro Pro 



Pro Pro Glu Pro 
160 

Gly Gly Pro Ala 
175 

Gin Ser Thr Met 
190 

Ser Ser Arg Thr 
205 

Phe Asn He Thr 



Trp Met Pro Gly 
240 

Gin Val Thr Gin 
255 

Pro Val Pro Pro 
27 0 

Asn Tyr Ser Leu 
285 

Tyr Ala Asp Trp 



Ala Pro Gin Asn 
320 

Glu Trp Glu Glu 
335 
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Val lie Pro Glu Ala Pro Leu Glu Gly Pro Leu Gly Pro Tyr Lys Leu 
340 345 350 

Ser Trp Val Gin Asp Asn Gly Thr Gin Asp Glu Leu Thr Val Glu Gly 
355 360 365 

Thr Arg Ala Asn Leu Thr Gly Trp Asp Pro Gin Lys Asp Leu lie Val 
370 375 380 

Arg Val Cys Val Ser Asn Ala Val Gly Cys Gly Pro Trp Ser Gin Pro 
385 390 395 400 

Leu Val Val Ser Ser His Asp Arg Ala Gly Gin Gin Gly Pro Pro His 
405 410 415 

Ser Arg Thr Ser Trp Val Pro Val Val Leu Gly Val Leu Thr Ala Leu 
420 425 430 

Val Thr Ala Ala Ala Leu Ala Leu lie Leu Leu Arg Lys Arg Arg Lys 
435 440 445 

G'lu Thr Arg Phe Gly Gin Ala Phe Asp Ser Val Met Ala Arg Gly Glu 
450 455 460 

Pro Ala Val His Phe Arg Ala Ala Arg Ser Phe Asn Arg Glu Arg Pro 
465 470 475 480 

Glu Arg lie Glu Ala Thr Leu Asp Ser Leu Gly lie Ser Asp Glu Leu 
485 490 495 

Lys Glu Lys Leu Glu Asp Val Leu lie Pro Glu Gin Gin Phe Thr Leu 
500 505 510 

Gly Arg Met Leu Gly Lys Gly Glu Phe Gly Ser Val Arg Glu Ala Gin 
515 520 525 

Leu Lys Gin Glu Asp Gly Ser Phe Val Lys Val Ala Val Lys Met Leu 
530 535 540 

Lys Ala Asp lie lie Ala Ser Ser Asp lie Glu- Glu Phe Leu Arg Glu 
545 550 555 560 

Ala Ala Cys Met Lys Glu Phe Asp His Pro His Val Ala Lys Leu Val 
565 570 575 

Gly Val Ser Leu Arg Ser Arg Ala Lys Gly Arg Leu Pro lie Pro Met 
580 585 590 

Val lie Leu Pro Phe Met Lys His Gly Asp Leu His Ala Phe Leu Leu 
595 600 605 

Ala Ser Arg lie Gly Glu Asn Pro Phe Asn Leu Pro Leu Gin Thr Leu 
610 615 620 

lie Arg Phe Met Val Asp lie Ala Cys Gly Met Glu Tyr Leu Ser Ser 
625 630 635 640 

Arg Asn Phe lie His Arg Asp Leu Ala Ala Arg Asn Cys Met Leu Ala 
645 650 655 

Glu Asp Met Thr Val Cys Val Ala Asp Phe Gly Leu Ser Arg Lys lie 



54 



660 



665 



670 



Tyr Ser Asp Cys Arg Tyr lie Leu Thr Pro Giy Gly Leu Ala Glu Gin 

675 680 685 

Pro Gly Gin Ala Glu His Gin Pro Glu Ser Pro Leu Asn Glu Thr Gin 

690 695 700 

Arg Leu Leu Leu Leu Gin Gin Gly Leu Leu Pro His Ser Ser Cys 

705 710 715 



<210> 56 

<211> 848 

<212> PRT 

<213> Homo sapiens 

<400> 56 

Met Cys Arg lie Ala Gly Ala Leu Arg Thr Leu Leu Pro Leu Leu Ala 

1 5 ; ' 10 15 

Ala Leu Leu Gin Ala Ser Val Glu Ala Ser Gly Glu lie Ala Leu Cys 
20 25 30 

Lys Thr Gly Phe Pro Glu Asp Val Tyr Ser Ala Val Leu Ser Lys Asp 
35 40 45 

Val His Glu Gly Gin Pro Leu Leu Asn Val Lys Phe Ser Asn Cys Asn 
50 55 60 

Gly Lys Arg Lys Val Gin Tyr Glu Ser Ser Glu Pro Ala Asp Phe Lys 
65 70 75 80 

Val Asp Glu Asp Gly Met Val Tyr Ala Val Arg Ser Phe Pro Leu Ser 
85 90 95 

Ser Glu His Ala Lys Phe Leu lie Tyr Ala Gin Asp Lys Glu Thr Gin 
100 105 110 

Glu Lys Trp Gin Val Ala Val Lys Leu Ser Leu Lys Pro Thr Leu Thr 
115 120 125 

Glu Glu Ser Val Lys Glu Ser Ala Glu Val Glu Glu lie Val Phe Pro 
130 135 140 

Arg Gin Phe Ser Lys His Ser Gly His Leu Gin Arg Gin Lys Arg Asp 
145 150 155 160 

Trp Val lie Pro Pro lie Asn Leu Pro Glu Asn Ser Arg Gly Pro Phe 
165 170 175 

Pro Gin Glu Leu Val Arg lie Arg Ser Asp Arg Asp Lys Asn Leu Ser 
180 185 190 

Leu Arg Tyr Ser Val Thr Gly Pro Gly Ala Asp Gin Pro Pro Thr Gly 
195 200 205 

lie Phe lie lie Asn Pro lie Ser Gly Gin Leu Ser Val Thr Lys Pro 
210 215 220 
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Leu Asp Arg Glu Gin lie Ala Arg Phe His Leu Arg Ala His Ala Val 
225 230 235 240 

Asp lie Asn Gly Asn Gin Val Glu Asn Pro lie Asp lie Val lie Asn 
245 250 255 

Val lie Asp Met Asn Asp_Asn Arg Pro Glu Phe Leu His Gin Val Trp 
260 265 270 

Asn Gly Thr Val Pro Glu Gly Ser Lys Pro Gly Thr Tyr Val Met Thr 
275 280 285 

-Val Thr Ala lie Asp Ala Asp Asp Pro Asn Ala Leu Asn Gly Met Leu 
290 295 300 

Arg Tyr Arg lie Val Ser Gin Ala Pro Ser Thr Pro Ser Pro Asn Met 
305 310 315 320 

Phe Thr lie Asn Asn Glu Thr Gly Asp lie lie Thr Val Ala Ala Gly 
325 330 335 

Leu Asp Arg Glu Lys Val Gin Gin Tyr Thr Leu lie lie Gin Ala Thr 
340 345 350 

Asp Met Glu Gly Asn Pro Thr Tyr ' Gly Leu Ser Asn Thr Ala Thr Ala 
355 . 360 365 

Val lie Thr Val Thr Asp Val Asn Asp Asn Pro Pro Glu Phe Thr Ala 
370 375 380 

Met Thr Phe Tyr Gly Glu Val Pro Glu Asn Arg Val Asp lie lie Val 
385 390 395 400 

Ala Asn Leu Thr Val Thr Asp Lys Asp Gin Pro His Thr Pro Ala Trp 
405 410 415 

Asn Ala Val Tyr Arg lie Ser Gly Gly Asp Pro Thr Gly Arg Phe Ala 
420 425 430 

lie Gin Thr Asp Pro Asn Ser Asn Asp Gly Leu Val Thr Val Val Lys 
435 440 445 

Pro lie Asp Phe Glu Thr Asn Arg Met Phe Val Leu Thr Val Ala Ala 
450 455 460 

Glu Asn Gin Val Pro Leu Ala Lys Gly lie Gin His Pro Pro Gin Ser 
465 470 475 480 

Thr Ala Thr Val Ser Val Thr Val lie Asp Val Asn Glu Asn Pro Tyr 
485 490 495 

Phe Ala Pro Asn Pro Lys lie lie Arg Gin Glu Glu Gly Leu His Ala 
500 505 510 

Gly Thr Met Leu Thr Thr Phe Thr Ala Gin Asp Pro Asp Arg Tyr Met 
515 520 525 

Gin Gin Asn lie Arg Tyr Thr Lys Leu Ser Asp Pro Ala Asn Trp Leu 
530 535 540 

Lys lie Asp Pro Val Asn Gly Gin lie Thr Thr lie Ala Val Leu Asp 
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545 



550 



555 



560 



Arg Glu Ser Pro Asn Val Lys Asn Asn lie Tyr Asn Ala Thr Phe Leu 
565 570 575 

Ala Ser Asp Asn Gly lie Pro Pro Met Ser Gly Thr Gly Thr Leu Gin 
580 585 590 



lie Tyr Leu Leu Asp lie Asn Asp Asn Ala Pro Gin Val Leu Pro Gin 
595 600 605 

Glu Ala Glu Thr Cys Glu Thr Pro Asp Pro Asn Ser lie Asn lie Thr 
610 615 620 

Ala Leu Asp Tyr Asp lie Asp Pro Asn Ala Gly Pro Phe Ala Phe Asp 

625 630 635 640 



Leu Pro Leu Ser Pro Val Thr lie 
645 

Leu Asn Gly Asp Phe Ala Gin Leu 
660 

Ala Gly lie Tyr Glu Val Pro lie 
675 680 



Lys Arg Asn Trp Thr lie Thr Arg 
650 655 

Asn Leu Lys lie Lys Phe Leu Glu 
665 670 

lie lie Thr Asp Ser Gly Asn Pro 
685 



Pro Lys Ser Asn 
690 

Ser Asn Gly Asp 
705 



lie Ser lie Leu 
695 

Cys Thr Asp Val 
710 



Arg Val Lys Val 
700 

Asp Arg lie Val 
715 



Cys Gin Cys Asp 

Gly Ala Gly Leu 
720 



Gly Thr Gly Ala 

lie Leu Val Leu 
740 



lie lie Ala lie 
725 

Met Phe Val Val 



Leu Leu Cys lie 
730 

Trp Met Lys Arg 
745 



lie lie Leu Leu 
735 

Arg Asp Lys Glu 
750 



Arg Gin Ala Lys Gin 
755 

Asn lie Leu Lys Tyr 
770 

Tyr Asp Leu Ser Gin 
785 

lie Lys Pro Val Gly 
805 

Glu Pro Gin Tyr Pro 
820 

Gly Asp Phe lie Asn 
835 



Leu Leu lie Asp Pro 
760 

Asp Glu Glu Gly Gly 
775 

Leu Gin Gin Pro Asp 
790 

lie Arg Arg Met Asp 
810 

Val Arg Ser Ala Ala 
825 

Glu Lys Thr Trp Pro 
840 



Glu Asp Asp Val Arg Asp 
765 

Gly Glu Glu Asp Gin Asp 
780 

Thr Val Glu Pro Asp Ala 
795 800 

Glu Arg Pro lie His Ala 
815 

Pro His Pro Gly Asp lie 
830 

lie Gin Ser Leu His Leu 
845 
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<210> 57 
<211> 103 
<212> PRT 

<213> Homo sapiens 



<400> 57 
Met Glu Arg Val 
1 

Glu Phe Leu Glu 
20 

Ser Phe Pro Ser 
35 

Arg Arg Leu Ser 
50 



Lys Met lie Asn 
5 

Arg Arg Glu Arg 

Met Pro Ser Pro 
40 

Arg Ala Gin Lys 
55 



Val Gin Arg Leu 
10 

Glu Cys Glu His 
25 

Arg Leu Gin His 



His Ser Ser Gly 
60 



Leu Glu Ala Ala 
15 

Gly Tyr Ala Ser 
30 

Ser Lys Pro Pro 
45 

Ser Ser Asn Thr 



Ser Thr Ala Asn Arg Ser Thr His Asn Glu Leu Glu Lys Asn Arg Leu 
65 7:0 75 80 

Lys Asn Trp Leu Val Gly Arg Arg Asp Thr Arg Gly Met Lys Met Leu 
85 90 95 

Leu Lys Ala He Ala Val He 
100 



<210> 58 
<211> 234 
<212> PRT 

<213> Homo sapiens 



<400> 58 

Met Glu Lys His He Asn Thr Phe Leu Gin Asn Val Gin He Leu Leu 
15 10 15 



Glu Ala Ala Ser Tyr Leu Glu Gin He Glu ' Lys Glu Asn Lys Lys Cys 
20 25 30 

Glu His Gly Tyr Ala Ser Ser Phe Pro Ser Met Pro" Ser Pro Arg Leu 
35 40 45 

Gin His Ser Lys Pro Pro Arg Arg Leu Ser Arg Ala Gin Lys His Ser 
50 55 60 



Ser Gly Ser Ser Asn Thr Ser Thr Ala Asn Arg Ser Thr His Asn Glu 

65 70 75 80 

Leu Glu Lys Asn Arg Arg Ala His Leu Arg Leu Cys Leu Glu Arg Leu 

85 90 95 



Lys Val Leu He Pro Leu Gly Pro Asp Cys Thr Arg His Thr Thr Leu 
100 105 110 

Gly Leu Leu Asn Lys Ala Lys Ala His He Lys Lys Leu Glu Glu Ala 
115 120 125 

Glu Arg Lys Ser Gin His Gin Leu Glu Asn Leu Glu Arg Glu Gin Arg 
130 135 140 
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Phe Leu Lys Trp 
145 

Arg lie Arg Met 



Asp Ser Glu Arg 
180 

Ser His Gly Glu 
195 

Asp Asp His Ser 
210 

Ser Ala Ser Val 
225 



Arg Leu Glu Gin 
150 

Asp Ser lie Gly 
165 

Glu Glu lie Glu 



Val Asp Asn lie 
200 



Ser Leu Pro Ser 
215 

Lys Leu Ser Phe 
230 



Leu Gin Gly Pro 
155 

Ser Thr lie Ser 
170 

Val Asp Val Glu 
185 

Ser thr Thr Ser 



lie Gly Ser Asp 
220 

Thr Ser 



Gin Glu Met Glu 
160 

Ser Asp Arg Ser 
175 

Ser Thr Glu Phe 
190 

lie Ser Asp lie 
205 

Glu Gly Tyr Ser 



<210> 59 
<211> 329 
<212> PRT 

<213> Homo sapiens 
<400> 59 

Met Glu Ser Pro Ala Ser Ser Gin Pro Ala Ser Met Pro Gin Ser Lys 
15 10 15 

Gly Lys Ser Lys Arg Lys Lys Asp Leu Arg lie Ser Cys Met Ser Lys 

20 . 25 30 

Pro Pro Ala Pro Asn Pro Thr Pro Pro Arg Asn Leu Asp Ser Arg Thr 
35 40 45 

Phe lie Thr lie Gly Asp Arg Asn Phe Glu Val Glu Ala Asp Asp Leu 
50 55 60 

Val Thr lie Ser Glu Leu Gly Arg Gly Ala Tyr Gly Val Val Glu Lys 
65 70 75 80 

Val Arg His Ala Gin Ser Gly Thr lie Met Ala Val Lys Arg lie Arg 
85 90 95 

Ala Thr Val Asn Ser Gin Glu Gin Lys Arg Leu Leu Met Asp Leu Asp 
100 105 110 

rie Asn Met Arg Thr Val Asp Cys Phe Tyr Thr Val Thr Phe Tyr Gly 
115 120 125 

Ala Leu Phe Arg Glu Gly Asp Val Trp lie Cys Met Glu Leu Met Asp 
130 135 140 

Thr Ser Leu Asp Lys Phe Tyr Arg Lys Val Leu Asp Lys Asn Met Thr 
145 150 155 160 

lie Pro Glu Asp lie Leu Gly Glu lie Ala Val Ser lie Val Arg Ala 
165 170 175 

Leu Glu His Leu His Ser Lys Leu Ser Val lie His Arg Asp Val Lys 
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180 



185 



190 



Pro Ser Asn Val 
195 

Phe Gly lie Ser 
210 

Ala Gly Cys Lys 
225 

Asn Gin Lys Gly 



Thr Met lie Glu 
260 

Thr Pro Phe Gin 
275 

Leu Pro Ala Asp 
290 

Cys Leu Arg Lys 
305 

Gly Ala Asp Arg 



Leu lie Asn Lys 
200 

Gly Tyr Leu Val 
215 

Pro Tyr Met Ala 
230 

Tyr Asn Val Lys 
245 

Met Ala lie Leu 



Gin Leu Lys Gin 
280 

Arg Phe Ser Pro 
295 

Asn Pro Ala Glu 
310 

Phe Ser Pro Thr 
325 



Glu Gly His Val 



Asp Ser Val Ala 
220 

Pro Glu Arg lie 
235 

Ser Asp Val Trp 
250 

Arg Phe Pro Tyr 
265 

Val Val Glu Glu 



Glu Phe Val Asp 
300 

Arg Met Ser Tyr 
315 

Pro 



Lys Met Cys Asp 
205 

Lys Thr Met Asp 



Asn Pro Glu Leu 
240 

Ser Leu Gly lie 
255 

Glu Ser Trp Gly 
270 

Pro Ser Pro Gin 
285 

Phe Thr Ala Gin 



Leu Glu Leu lie 
320 



<210> 60 
<211> 292 
<212> PRT 

<213> Homo sapiens 
<400> 60 

Met Pro Glu lie Arg Leu Arg His Val Val Ser Cys Ser Ser Gin Asp 
1 5 10. 15 

Ser Thr His Cys Ala Glu Asn Leu Leu Lys Ala Asp Thr Tyr Arg Lys 
20 25 30 

Trp Arg Ala Ala Lys Ala Gly Glu Lys Thr lie Ser Val Val Leu Gin 
35 40 45 

Leu Glu Lys Glu Glu Gin lie His Ser Val Asp lie Gly Asn Asp Gly 
50 55 60 

Ser Ala Phe Val Glu Val Leu Val Gly Ser Ser Ala Gly Gly Ala Gly 
65 70 75 80 

Glu Gin Asp Tyr Glu Val Leu Leu Val Thr Ser Ser Phe Met Ser Pro 
85 90 95 

Ser Glu Ser Arg Ser Gly Ser Asn Pro Asn Arg Val Arg Met Phe Gly 
100 105 110 

Pro Asp Lys Leu Val Arg Ala Ala Ala Glu Lys Arg Trp Asp Arg Val 
115 120 125 
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Lys lie Val Cys 
130 

Ser Phe Val Arg 
145 

Pro Ser Gin Lys 



Glu Glu Asp Glu 
180 

Ser Arg lie Asn 
195 

Pro Ser Tyr Ala 
210 

Ala Ser Pro Val 
225 

Ser Pro Trp His 



Ser Arg Ser Tyr 

260 

Glu Val Cys Ala 
275 

Thr Arg Cys lie 
290 



Ser Gin Pro Tyr 
135 

Phe His Ser Pro 
150 

Val Thr Val Thr 
165 

Ser Ala Asn Ser 



Lys Thr Ser Pro 
200 

Ala Ala Thr Leu 
215 

Ser Arg Ala lie 
230 

Ser Phe Val Pro 
245 

Phe Leu Thr Ser 



lie His Thr His 
280 



Ser Lys Asp Ser 
140 

Pro Asp Lys Asp 
155 

Lys Leu Gly Gin 
170 

Leu Arg Pro Gly 
185 ■ 

Val Thr Ala Ser 



Gin Ala Ser Ser 
220 

Gly Ser Thr Ser 

235; 

Asp Gly Ser Thr 
250 

Ser Met Gly Trp 
265 

Thr His Thr His 



Pro Phe Gly Leu 



Glu Ala Glu Ala 
160 

Phe Arg Val Lys 
175 

Ala Leu Phe Phe 
190 

Asp Pro Ala Gly 
205 

Ala Ala Ser Ser 



Lys Pro Gin Glu 
240 

Val Ala Met Arg 
255 

Cys Arg Lys Pro 
270 

Thr His Thr His 
285 



<210> 61 
<211> 266 
<212> PRT 

<213> Homo sapiens 
<400> 61 

Met Pro Glu lie Arg Leu Arg His Val Val Ser Cys Ser Ser Gin Asp 
15 10 15 

Ser Thr His Cys Ala Glu Asn Leu Leu Lys Ala Asp Thr Tyr Arg Lys 
20 25 30 

Trp Arg Ala Ala Lys Ala Gly Glu Lys Thr lie Ser Val Val Leu Gin 
35 40 45 

Leu Glu Lys Glu Glu Gin lie His Ser Val Asp lie Gly Asn Asp Gly 
50 55 60 

Ser Ala Phe Val Glu Val Leu Val Gly Ser Ser Ala Gly Gly Ala Gly 
65 70 75 80 

Glu Gin Asp Tyr Glu Val Leu Leu Val Thr Ser Ser Phe Met Ser Pro 
85 90 95 

Ser Glu Ser Arg Ser Gly Ser Asn Pro Asn Arg Val Arg Met Phe Gly 
100 105 110 
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Pro Asp Lys Leu 
115 

Lys lie Val Cys 
130 

Ser Phe Val Arg 
145 

Pro Ser Gin Lys 



Glu Glu Asp Glu 
180 

Ser Arg lie Asn 
195 

Pro Ser Tyr Ala 
210 

Ala Ser Pro Val 
225 

Ser Ser Asp Phe 



Ser lie Pro lie 
260 



Val Arg Ala Ala 
120 

Ser Gin Pro Tyr 
135 

Phe His Ser Pro 
150 

Val Thr Val Thr 
165 

Ser Ala Asn Ser 



Lys Thr Ser Pro 
200 

Ala Ala Thr Leu 
215 

Ser Arg Ala lie 
230 

Gly Gly Val Glu 
245 

Pro Ser Ala Pro 



Ala Glu Lys Arg 



Ser Lys Asp Ser 
140 

Pro Asp Lys Asp 
155 

Lys Leu Gly Gin 
170 

Leu Arg Pro Gly 
185 

Val Thr Ala Ser 



Gin Ala Ser Ser 
220 

Gly Ser Thr Ser 
235 

Glu Glu Arg Ser 
250 

Gly Ser 
265 



Trp Asp Arg Val 
125 

Pro Phe Gly Leu 



Glu Ala Glu Ala 
160 

Phe Arg Val Lys 
175 

Ala Leu Phe Phe 
190 

Asp Pro Ala Gly 
205 

Ala Ala Ser Ser 



Lys Pro Gin Glu 
240 

Trp Arg Pro Gin 
255 



<210> 62 
<211> 247 
<212> PRT 

<213> Homo sapiens 
<400> 62 

Met Pro Glu lie Arg Leu Arg His Val Val Ser Cys Ser Ser Gin Asp 
15 10 15 

Ser Thr His Cys Ala Glu Asn Leu Leu Lys Ala Asp Thr Tyr Arg Lys 
20 25 30 

Trp Arg Ala Ala Lys Ala Gly Glu Lys Thr lie Ser Val Val Leu Gin 
35 40 45 

Leu Glu Lys Glu Glu Gin lie His Ser Val Asp lie Gly Asn Asp Gly 
50 55 60 

Ser Ala Phe Val Glu Val Leu Val Gly Ser Ser Ala Gly Gly Ala Gly 
65 70 75 80 

Glu Gin Asp Tyr Glu Val Leu Leu Val Thr Ser Ser Phe Met Ser Pro 
85 90 95 

Ser Glu Ser Arg Ser Gly Ser Asn Pro Asn Arg Val Arg Met Phe Gly 
100 105 110 

Pro Asp Lys Leu Val Arg Ala Ala Ala Glu Lys Arg Trp Asp Arg Val 
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- 115 

Lys lie Val Cys 
130 

■Ser Phe Val Arg 
145 

Pro Ser Gin Lys 



Glu Glu Asp Glu 
180 

Asp Arg Val Gin 
195 

Glu Glu Ala Leu 
210 

Trp lie Tyr Ser 
225 

Tyr Gly Val Val 



120 

Ser Gin Pro Tyr 
135 

Phe His Ser Pro 
150 

Val Thr Val Thr 
165 

Ser Ala Asn Ser 



Phe Val lie Thr 
200 

Met Asp Asn Pro 
215 

Cys Asn Glu Lys 
230 

Pro Gin Ala 
245 



Ser Lys Asp Ser 
140 

Pro Asp Lys Asp 
155 

Lys Leu Gly Gin 
170 

Leu Arg Leu Glu 
185 

Ala Gin Glu Trp 



Ser Leu Ala Phe 
220 

Gin Lys Leu Leu 
235 



12 5 

Pro Phe Gly Leu 



Glu Ala Glu Ala 
160 

Phe Arg Val Lys 
175 

Asp Tyr Met Ser 
190 

Asp Pro Ser Phe 
205 

Val Arg Pro Arg 



Pro His Gin Leu 
240 



<210> 63 
<211> 624 
<212> PRT 

<213> Homo sapiens 
<400> 63 

Met Pro Glu lie Arg Leu Arg His Val Val Ser Cys Ser Ser Gin Asp 
15 10 15 

Ser Thr His Cys Ala Glu Asn Leu Leu Lys Ala Asp Thr Tyr Arg Lys 
20 25 30 

Trp Arg Ala Ala Lys Ala Gly Glu Lys Thr lie Ser Val Val Leu Gin 
35 40 45 

Leu Glu Lys Glu Glu Gin lie His Ser Val Asp lie Gly Asn Asp Gly 
50 55 60 

Ser Ala Phe Val Glu Val Leu Val Gly Ser Ser Ala Gly Gly Ala Gly 
65 70 75 80 

Glu Gin Asp Tyr Glu Val Leu Leu Val Thr Ser Ser Phe Met Ser Pro 
85 90 95 

Ser Glu Ser Arg Ser Gly Ser Asn Pro Asn Arg Val Arg Met Phe Gly 
100 105 110 

Pro Asp Lys Leu Val Arg Ala Ala Ala Glu Lys Arg Trp Asp Arg Val 
115 120 125 

Lys lie Val Cys Ser Gin Pro Tyr Ser Lys Asp Ser Pro Phe Gly Leu 
130 135 140 
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Ser Phe Val Arg 
145 

Pro Ser Gin Lys 



Glu Glu Asp Glu 
180 

Ser Arg lie Asn 
195 

Pro Ser Tyr Ala 
210 

Ala Ser Pro Val 
225 

Ser Pro Lys Gly 



Thr Pro Ser Lys 
260 

Pro Lys Leu Pro 
275 

Ala Arg Ala Gin 
290 

Glu Pro Arg Arg 
305 

Gin Gly Val Val 



Glu Leu Arg Asp 
340 

Trp Thr Arg Asp 
355 

Lys Tyr Ser Gin 
370 

Trp Val Leu Asp 
385 

Tyr Leu Met Ala 



His Ser Gly Gly 
420 

Pro Gin Thr Lys 
435 

Gin Lys Pro Pro 
450 

Gin Glu Asp lie 



Phe His Ser Pro 
150 

Val Thr Val Thr 
165 

Ser Ala Asn Ser 



Lys Thr Ser Pro 
200 

Ala Ala . Thr Leu 
215 

Ser Arg Ala lie 
230 

Lys Arg Lys Leu 
245 

Pro Pro Ala Gin 



Ala Pro Thr Arg 
280 

Gly Ala Val Thr 
295 

Pro Arg Ala Gly 
310 

Val Val Leu Ser 
325 

Lys Ala Leu Glu 



Ser Thr His Leu 
360 

Val Leu Gly Leu 
375 

Cys His Arg Met 
390 

Gly Pro Gly Ser 
405 

Ser Gly Asp Glu 



Thr Lys Pro Thr 
440 

Thr Pro Glu Glu 
455 

Asp lie Glu Gly 



Pro Asp Lys Asp 
155 

Lys Leu Gly Gin 
170 

Leu Arg Pro Gly 
185 

Val Thr Ala Ser 



Gin Ala Ser Ser 
220 

Gly Ser Thr Ser 
235 

Asp Leu Asn Gin 
250 

Leu Ser Pro Ser 
265 

Thr Pro Ala Thr 



Gly Lys Pro Arg 
300 

Pro Glu Glu Leu 
315 

Gly Phe Gin Asn 
330 

Leu Gly Ala Lys 
345 

lie Cys Ala Phe 



Gly Gly Arg lie 
380 

Arg Arg Arg Leu 
395 

Ser Ser Glu Glu 
410 

Ala Pro Lys Leu 
425 

Gin Ala Ala Gly 



Thr Lys Ala Ala 
460 

Val Gin Ser Glu 



Glu Ala Glu Ala 
160 

Phe Arg Val Lys 
175 

Ala Leu Phe Phe 
190 

Asp Pro Ala Gly 
205 

Ala Ala Ser Ser 



Lys Pro Gin Glu 
240 

Glu Glu Lys Lys 
255 

Val Pro Lys Arg 
270 

Ala Pro Val Pro 
285 

Gly Glu Gly Thr 



Gly Lys lie Leu 
320 

Pro Phe Arg Ser 
335 

Tyr Arg Pro Asp 
350 

Ala Asn Thr Pro 
365 

Val Arg Lys Glu 



Pro Ser Arg Arg 
400 

Asp Glu Ala Ser 
415 

Pro Gin Lys Gin 
430 

Pro Ser Ser Pro 
445 

Ser Pro Val Leu 



Gly Gin Asp Asn 
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465 

Gly Ala Glu Asp 



Glu Gin Lys Glu 
500 

Asp Pro Tyr Ala 
515 

Gin Glu Pro Pro 
530 

Gly Gin Ala Leu 
545 

Glu Thr His Pro 



Ser Phe Val Pro 
580 

Phe Leu Thr Ser 
595 

lie His Thr His 
610 



470 

Ser Gly Asp Thr 
485 

His Arg Leu Pro 



Gly Ser Thr Asp 
520 

Asp Leu Pro Val 
535 

Leu Ser Leu Arg 
550 

lie Arg His Ser 
565 

Asp Gly Ser Thr 



Ser Met Gly Trp 
600 

Thr His Thr His 
615 



475 

Glu Asp Glu Leu 
490 

Pro Gly Gin Glu 
505 

Glu Asn Thr Asp 



Pro Glu Leu Pro 
540 

Gly Val Pro Trp 
555 

Leu Gin Trp Gly 
570 

Val Ala Met Arg 
585 

Cys Arg Lys Pro 



Thr His Thr His 
620 



480. 

Arg Arg Val Ala 
495 

Glu Asn Gly Glu 
510 

Ser Glu Glu His 
525 

Arg Phe Leu Pro 



Gly Arg Ala Ala 
560 

Ala Pro Trp His 
575 

Ser Arg Ser Tyr 
590 

Glu Val Cys Ala 
605 

Thr Arg Cys lie 



<210> 64 
<211> 567 
<212> PRT 

<213> Homo sapiens 
<400> 64 

Met Ala Gly Ala lie Ala Ser Arg Met Ser Phe Ser Ser Leu Lys Arg 
15 10 15 

Lys Gin Pro Lys Thr Phe Thr Val Arg lie Val Thr Met Asp Ala Glu 
20 25 30 

Met Glu Phe Asn Cys Glu Met Lys Trp Lys Gly Lys Asp Leu Phe Asp 
35 40 45 

Leu Val Cys Arg Thr Leu Gly Leu Arg Glu Thr Trp Phe Phe Gly Leu 
50 55 60 

Gin Tyr Thr lie Lys Asp Thr Val Ala Trp Leu Lys Met Asp Lys Lys 
65 70 75 80 

Val Leu Asp His Asp Val Ser Lys Glu Glu Pro Val Thr Phe His Phe 
85 90 95 

Leu Ala Lys Phe Tyr Pro Glu Asn Ala Glu Glu Glu Leu Val Gin Glu 
100 105 110 



65 



lie Thr Gin His 
115 

Glu Lys lie Tyr 
130 

Ala Val Gin Ala 
145 

Gly Phe Leu Ala 



Tyr Gin Met Thr 
180 

Ala Glu His Arg 
195 

Lys lie Ala Gin 
210 

Arg Asn Lys Lys 
225 

Leu His lie Tyr 



Pro Trp Lys Asn 
260 

Thr lie Lys Pro 
275 

Ser Lys Leu Arg 
290 

His Asp Leu Phe 
305 

Gin Met Lys Ala 



Glu Glu Ala Thr 
340 

Ala Asp Leu Leu 
355 

Leu Leu Ala Gin 
370 

Lys Ala Thr Ala 
385 

Lys Val Leu Glu 



Glu Arg Arg Ala 
420 

Ala Arg Glu Ala 



Leu Phe Phe Leu 
120 

Cys Pro Pro Glu 

- 135 

Lys Tyr Gly Asp 
150 

Gin Glu Glu Leu 
165 

Pro Glu Met Trp 



Gly Arg Ala Arg 
200 

Asp Leu Glu Met 
215 

Gly Thr Glu Leu 
230 

Asp Pro Glu Asn 
245 

Glu lie Arg Asn 



Leu Asp Lys Lys 
280 

Val Asn Lys Leu 
295 

Met Arg Arg Arg 
310 

Gin Ala Arg Glu 
325 

Met Ala Asn Glu 



Ala Glu Lys Ala 
360 

Lys Ala Ala Glu 
375 

lie Arg Thr Glu 
390 

Ala Glu Val Leu 
405 

Lys Glu Ala Asp 



Glu Arg Arg Ala 



Gin Val Lys Lys 



Ala Ser Val Leu 
140 

Tyr Asp Pro Ser 
155 

Leu Pro Lys Arg 
170 

Glu Glu Arg lie 
185 

Asp Glu Ala Glu 



Tyr Gly Val Asn 
220 

Leu Leu Gly Val 
235 

Arg Leu Thr Pro 
250 

lie Ser Tyr Ser 
265 

lie Asp Val Phe 



lie Leu Gin Leu 
300 

Lys Ala Asp Ser 
315 

Glu Lys Ala Arg 
330 

Ala Leu Met Arg 
345 

Gin lie Thr Glu 



Ala Glu Gin Glu 
380 

Glu Glu Lys Arg 
395 

Ala Leu Lys Met 
410 

Gin Leu Lys Gin 
425 

Lys Gin Lys Leu 



Gin lie Leu Asp 
125 

Leu Ala Ser Tyr 



Val His Lys Arg 
160 

Val lie Asn Leu 
175 

Thr Ala Trp Tyr 
190 

Met Glu Tyr Leu 
205 

Tyr Phe Ala lie 



Asp Ala Leu Gly 
240 

Lys lie Ser Phe 
255 

Asp Lys Glu Phe 
270 

Lys Phe Asn Ser 
285 

Cys lie Gly Asn 



Leu Glu Val Gin 
320 

Lys Gin Met Lys 
335 

Ser Glu Glu Thr 
350 

Glu Glu Ala Lys 
365 

Met Gin Arg lie 



Leu Met Glu Gin 
400 

Ala Glu Glu Ser 
415 

Asp Leu Gin Glu 
430 

Leu Glu lie Ala 
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435 



440 



Thr Lys Pro Thr Tyr 
450 

Pro Asp lie Pro Ser 
465 

Phe Lys Asp Thr Asp 
485 

Lys Val Glu Tyr Met 
500 

Glu Leu Lys Thr Glu 
515 

Ala Leu Asp lie Leu 
530 

Lys His Asn Thr lie 
545 

Val Ala Phe Phe Glu 
565 



Pro Pro Met Asn Pro lie 
455 

Phe Asn Leu lie Gly Asp 
470 475 

Met Lys Arg Leu Ser Met 
490 

Glu Lys Ser Lys His Leu 
505 

lie Glu Ala Leu Lys Leu 
520 

His Asn Glu Asn Ser Asp 
535 

Lys Lys Leu Thr Leu Gin 
550 555 

Glu Leu 



Pro Ala Pro Leu Pro 
460 

Ser Leu Ser Phe Asp 
480 

Glu lie Glu Lys Glu 
495 

Gin Glu Gin Leu Asn 
510 

Lys Glu Arg Glu Thr 
525 

Arg Gly Gly Ser Ser 
540 

Ser Ala Lys Ser Arg 
560 



<210> 65 
<211> 134 
<212> PRT 

<213> Homo sapiens 
<400> 65 

Met Arg Glu Arg Phe Asp Arg Phe Leu His Glu Lys Asn Cys Met Thr 
1 5 10 15 

Asp Leu Leu Ala Lys Leu Glu Ala Lys Thr Gly Val Asn Arg Ser Phe 
20 25 30 

lie Ala Leu Gly Val lie Gly Leu Val Ala Leu Tyr Leu Val Phe Gly 
35 40 . 45 

Tyr Gly Ala Ser Leu Leu Cys Asn Leu lie Gly Phe Gly Tyr Pro Ala 
50 55 60 

Tyr lie Ser lie Lys Ala lie Glu Ser Pro Asn Lys Glu Asp Asp Thr 
65 70 75 80 

Gin Trp Leu Thr Tyr Trp Val Val Tyr Gly Val Phe Ser lie Ala Glu 
85 90 95 

Phe Phe Ser Asp lie Phe Leu Ser Trp Phe Pro Phe Tyr Tyr Met Leu 
100 105 110 

Lys Gin lie Tyr Leu Glu Pro Pro Cys Ala Arg Phe Cys Ser Thr Ser 
115 120 125 

Gly Arg Tyr Phe Gly Glu 
130 



67 



<210> 66 
<211> 1278 
<212> PRT 

<213> Homo sapiens 
<400> 66 

Met Asp Leu Glu Gly Asp Arg Asn Gly Gly Ala Lys Lys Lys Asn Phe 
1 5 10 15 

Phe Lys Leu Asn Asn Lys Ser Glu Lys Asp Lys Lys Glu Lys Lys Pro 
20 25 30 

Thr Val Ser Val Phe Ser Met Phe Arg Tyr Ser Asn Trp Leu Asp Lys 
35 40 45 

Leu Tyr Met Val Val Gly Thr Leu Ala Ala lie lie His Gly Ala Gly 
50 55 60 

Leu Pro Leu Met Met Leu Val Phe Gly Glu Met Thr Asp lie Phe Ala 
65 70 75 80 

Asn Ala Gly Asn Leu Glu Asp Leu Met Ser Asn lie Thr Asn Arg Ser 
85 90 95 

Asp lie Asn Asp Thr Gly Phe Phe Met Asn Leu Glu Glu Asp Met Thr 
100 105 110 

Arg Tyr Ala Tyr Tyr Tyr Ser Gly lie Gly Ala Gly Val Leu Val Ala 
115 120 125 

Ala Tyr lie Gin Val Ser Phe Trp Cys Leu Ala Ala Gly Arg Gin lie 
130 135 140 

His Lys lie Arg Lys Gin Phe Phe His Ala lie Met Arg Gin Glu lie 
145 150 155 160 

Gly Trp Phe Asp Val His Asp Val Gly Glu' Leu Asn Thr Arg Leu Thr 
165 170 175 . 

Asp Asp Val Ser Lys lie Asn Glu Val lie Gly Asp Lys lie Gly Met 
180 185 190 

Phe Phe Gin Ser Met Ala Thr Phe Phe Thr Gly Phe lie Val Gly Phe 
195 200 205 

Thr Arg Gly Trp Lys Leu Thr Leu Val lie Leu Ala lie Ser Pro Val 
210 215 220 

Leu Gly Leu Ser Ala Ala Val Trp Ala Lys lie Leu Ser Ser Phe Thr 
225 230 235 240 

Asp Lys Glu Leu Leu Ala Tyr Ala Lys Ala Gly Ala Val Ala Glu Glu 
245 250 255 

Val Leu Ala Ala He Arg Thr Val He Ala Phe Gly Gly Gin Lys Lys 
260 265 270 

Glu Leu Glu Arg Tyr Asn Lys Asn Leu Glu Glu Ala Lys Arg lie Gly 
275 280 285 
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lie Lys Lys Ala lie Thr Ala Asn lie Ser lie Gly Ala Ala Phe Leu 
290 295 300 

Leu lie Tyr Ala Ser Tyr Ala Leu Ala Phe Trp Tyr Gly Thr Thr Leu 
305 310 315 320 

Val Leu Ser Gly Glu Tyr Servile Gly Gin Val Leu Thr Val Phe Phe 
325 330 335 

Ser Val Leu lie Gly Ala Phe Ser Val Gly Gin Ala Ser Pro Ser lie 
340 345 350 

Glu Ala Phe Ala Asn Ala Arg Gly Ala Ala Tyr Glu lie Phe Lys lie 
355 360 365 

lie Asp Asn Lys Pro Ser lie Asp Ser Tyr Ser Lys Ser Gly His Lys 
370 375 380 

Pro Asp Asn lie Lys Gly Asn Leu Glu Phe Arg Asn Val His Phe Ser 
385 390 395 400 

Tyr Pro Ser Arg Lys Glu Val Lys lie Leu Lys Gly Leu Asn Leu Lys 
405 410 415 

Val Gin Ser Gly Gin Thr Val Ala Leu Val Gly Asn Ser Gly Cys Gly 
420 425 430 

Lys Ser Thr Thr Val Gin Leu Met Gin Arg Leu Tyr Asp Pro Thr Glu 
435 440 445 

Gly Met Val Ser Val Asp Gly Gin Asp lie Arg Thr lie Asn Val Arg 
450 455 460 

Phe Leu Arg Glu lie lie Gly Val Val Ser Gin Glu Pro Val Leu Phe 
465 470 475 480 

Ala Thr Thr lie Ala Glu Asn lie Arg Tyr Gly Arg Glu Asn Val Thr 
485 490- 495 

Met Asp Glu lie Glu Lys Ala Val Lys Glu Ala Asn Ala Tyr Asp Phe 
500 505 510 

lie Met Lys Leu Pro His Lys Phe Asp Thr Leu Val Gly Glu Arg Gly 
515 520 525 

Ala Gin Leu Ser Gly Gly Gin Lys Gin Arg lie Ala lie Ala Arg Ala 
530 535 540 

Leu Val Arg Asn Pro Lys lie Leu Leu Leu Asp Glu Ala Thr Ser Ala 
545 550 555 560 

Leu Asp Thr Glu Ser Glu Ala Val Val Gin Val Ala Leu Asp Lys Ala 
565 570 575 

Arg Lys Gly Arg Thr Thr lie Val lie Ala His Arg Leu Ser Thr Val 
580 585 590 

Arg Asn Ala Asp Val lie Ala Gly Phe Asp Asp Gly Val lie Val Glu 
595 600 605 
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Lys Gly Asn His Asp Glu Leu Met Lys Glu Lys Gly lie Tyr Phe Lys 
610 615 620 



Leu Val Thr Met Gin Thr Ala Gly Asn Glu Val Glu Leu Glu Asn Ala 
625 630 635 640 

Ala Asp Glu Ser Lys Ser Glu lie Asp Ala Leu Glu Met Ser Ser Asn 
645 650 655 

Asp Ser Arg Ser Ser Leu lie Arg Lys Arg Ser Thr Arg Arg Ser Val 
660 665 ' 670 

Arg Gly Ser Gin Ala Gin Asp Arg Lys Leu Ser Thr Lys Glu Ala Leu 
675 680 685 

Asp Glu Ser lie Pro Pro Val Ser Phe Trp Arg lie Met Lys Leu Asn 
690 695 700 

Leu Thr Glu Trp Pro Tyr Phe Val Val Gly Val Phe Cys Ala lie He 
705 710 715 720 

Asn Gly Gly Leu Gin Pro Ala Phe Ala He He Phe Ser Lys lie He 
725 730 735 

Gly Val Phe Thr Arg He Asp Asp Pro Glu Thr Lys Arg Gin Asn Ser 
740 745 750 

Asn Leu Phe Ser Leu Leu Phe Leu Ala Leu Gly He He Ser Phe He 
755 760 765 

Thr Phe Phe Leu Gin Gly Phe Thr Phe Gly Lys Ala Gly Glu He Leu 
770 775 780 

Thr Lys Arg Leu Arg Tyr Met Val Phe Arg Ser Met Leu Arg Gin Asp 
785 790 795 800 

Val Ser Trp Phe Asp Asp "Pro Lys Asn Thr Thr Gly Ala Leu Thr Thr 
805 810 815 

Arg Leu Ala Asn Asp Ala Ala Gin Val Lys Gly Ala He Gly Ser Arg 
820 825 830 

Leu Ala Val He Thr Gin Asn He Ala Asn Leu Gly Thr Gly He He 
835 840 845 

He Ser Phe He Tyr Gly Trp Gin Leu Thr Leu Leu Leu Leu Ala He 
850 855 860 

Val Pro He He Ala He Ala Gly Val Val Glu Met Lys Met Leu Ser 
865 870 875 880 

Gly Gin Ala Leu Lys Asp Lys Lys Glu Leu Glu Gly Ala Gly Lys He 
885 890 895 



Ala Thr Glu Ala He Glu Asn Phe 
900 

Glu Gin Lys Phe Glu His Met Tyr 

915 920 

Arg Asn Ser Leu Arg Lys Ala His 



Arg Thr Val Val Ser Leu Thr Gin 
905 910 

Ala Gin Ser Leu Gin Val Pro Tyr 
925 

He Phe Gly He Thr Phe Ser Phe 
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930 



935 



940 



Thr Gin Ala Met Met Tyr Phe Ser 
945 950 

Ala Tyr Leu Val Ala His Lys Leu 
965 

Val Phe Ser Ala Val Val Phe Gly 
980 

Ser Phe Ala Pro Asp Tyr Ala Lys 
995 1000 

He Met He He Glu Lys Thr Pro 
1010 1015 

Gly Leu Met Pro Asn Thr Leu Glu 
1025 1030 

Val Phe Asn Tyr Pro Thr Arg Pro 
1045 



Tyr Ala Gly Cys Phe Arg Phe Gly 
955 960 

Met Ser Phe Glu Asp Val Leu Leu 
970 975 

Ala Met Ala Val Gly Gin Val Ser 
985 990 

Ala Lys He Ser Ala Ala His He 
1005 

Leu He Asp Ser Tyr Ser Thr Glu 
1020 

Gly Asn Val Thr Phe Gly Glu Val 
1035 1040 

Asp He Pro Val Leu Gin Gly Leu 
1050 1055 



Ser Leu Glu Val Lys Lys Gly Gin Thr Leu Ala Leu Val Gly Ser Ser 
1060 1065 1070 

Gly Cys Gly Lys Ser Thr Val Val Gin Leu Leu Glu Arg Phe Tyr Asp 
1075 1080 1085 

Pro Leu Ala Gly Lys Val Leu Leu Asp Gly Lys Glu He Lys Arg Leu 
1090 1095 1100 



Asn Val Gin Trp Leu Arg Ala His Leu Gly He Val Ser Gin Glu Pro 

1105 1110 1115 1120 

He Leu Phe Asp Cys Ser He Ala Glu Asn He Ala Tyr Gly Asp Asn 

1125 1130 1135 



Ser Arg Val Val Ser Gin Glu Glu He Val • Arg Ala Ala Lys Glu Ala 
1140 1145 1150 

Asn He His Ala Phe He Glu Ser Leu Pro Asn Lys Tyr Ser Thr Lys 
1155 1160 1165 

Val Gly Asp Lys Gly Thr Gin Leu Ser Gly Gly Gin Lys Gin Arg He 
1170 1175 1180 



Ala He Ala Arg Ala Leu Val Arg Gin Pro His He Leu Leu Leu Asp 
1185 1190 1195 1200 

Glu Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu Lys Val Val Gin Glu 
1205 1210 1215 

Ala Leu Asp Lys Ala Arg Glu Gly Arg Thr Cys He Val He Ala His 
1220 1225 1230 

Arg Leu Ser Thr He Gin Asn Ala Asp Leu He Val Val Phe Gin Asn 
1235 1240 1245 

Gly Arg Val Lys Glu His Gly Thr His Gin Gin Leu Leu Ala Gin Lys 
1250 1255 1260 
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Gly lie Tyr Phe Ser Met Val 
1265 1270 



Ser Val Gin Ala Gly Thr lie 
1275 



<210> 67 
<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 67 

Met Asp Leu Glu Gly Asp Arg Asn Gly Gly Ala Lys Lys Lys Asn Phe 
15 10 15 



Phe Lys Leu Asn Asn Lys Ser Glu Lys Asp Lys Lys Glu Lys Lys Pro 
20 25 30 

Thr Val Ser Val Phe Ser Met Phe Arg Tyr Ser Asn Trp Leu Asp Lys 
35 40 45 

Leu Tyr Met Val Val Gly Thr Leu Ala Ala lie lie His Gly Ala Gly 
50 55 60 

Leu Pro Leu Met Met Leu Val Phe Gly Glu Met Thr Asp lie Phe Ala 
65 70 75 80 

Asn Ala Gly Asn Leu Glu Asp Leu Met Ser Asn lie Thr Asn Arg Ser 
85 90 95 



Asp lie Asn Asp 
100 

Arg Tyr Ala Tyr 
115 

Ala Tyr lie Gin 
130 

His Lys lie Arg 
145 

Gly Trp Phe Asp 



Asp Asp Val Ser 
180 



Thr Gly Phe Phe 



Tyr Tyr Ser Gly 
120 

Val Ser Phe Trp 
135 

Lys Gin Phe Phe 
150 

Val His Asp Val 
165 

Lys lie Asn Glu 



Met Asn Leu Glu 
105 

lie Gly Ala Gly 



Cys Leu Ala Ala 
140 

His Ala lie Met 
155 

Gly Glu Leu Asn 
170 

Gly lie Gly Asp 
185 



Glu Asp Met Thr 
110 

Val Leu Val Ala 
125 

Gly Arg Gin lie 



Arg Gin Glu lie 
160 

Thr Arg Leu Thr 
175 

Lys lie Gly Met 
190 



Phe Phe Gin Ser Met Ala Thr Phe Phe Thr Gly Phe lie Val Gly Phe 

195 200 205 

Thr Arg Gly Trp Lys Leu Thr Leu Val lie Leu Ala lie Ser Pro Val 

210 215 220 

Leu Gly Leu Ser Ala Ala Val Trp Ala Lys lie Leu Ser Ser Phe Thr 

225 230 235 240 

Asp Lys Glu Leu Leu Ala Tyr Ala Lys Ala Gly Ala Val Ala Glu Glu 

245 250 255 

Val Leu Ala Ala He Arg Thr Val He Ala Phe Gly Gly Gin Lys Lys 
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260 



265 



270 



Glu Leu Glu Arg 
275 

lie Lys Lys Ala 
290 

Leu lie Tyr Ala 
305 

Val Leu Ser Gly 



Ser Val Leu lie 

3 4-0 

Glu Ala Phe Ala 
355 

lie Asp Asn Lys 
370 

Pro Asp Asn lie 
385 



Tyr Asn Lys Asn 
280 

lie Thr Ala Asn 
295 

Ser Tyr Ala Leu 
310 

Glu Tyr Ser lie 
325 

Gly Ala Phe Ser 



Asn Ala Arg Gly 
360 

Pro Ser lie Asp 
375 

Lys Gly Asn Leu 
390 



Leu Glu Glii Ala 



He Ser He Gly 
300 

Ala Phe Trp Tyr 
315 

Gly Gin Val Leu 
330 

Val Gly Gin Ala 
345 

Ala Ala Tyr Glu 



Ser Tyr Ser Lys 
380 

Glu Phe Arg Asn 
395 



Lys Arg He Gly 
285 

Ala Ala Phe Leu 



Gly Thr Thr Leu 
320 

Thr Val Phe Phe 
335 

Ser Pro Ser He 
350 

He Phe Lys He 
365 

Ser Gly His Lys 



Val His Phe Ser 
400 



Tyr Pro Ser Arg Lys Glu Val Lys He Leu Lys Gly Leu Asn Leu Lys 
405 410 415 

Val Gin Ser Gly Gin Thr Val Ala Leu Val Gly Asn Ser Gly Cys Gly 
420 425 430 

Lys Ser Thr Thr Val Gin Leu Met Gin Arg Leu Tyr Asp Pro Thr Glu 
435 440 445 

Gly Met Val Ser Val Asp Gly Gin Asp He Arg Thr He Asn Val Arg 
450 455 460 

Phe Leu Arg Glu He He Gly Val Val Ser Gin Glu Pro Val Leu Phe 
465 470 475 480 

Ala Thr Thr He Ala Glu Asn He Arg Tyr Gly Arg Glu Asn Val Thr 
485 490 495 

Met Asp Glu He. Glu Lys Ala Val Lys Glu Ala Asn Ala Tyr Asp Phe 
500 505 510 

He Met Lys Leu Pro His Lys Phe Asp Thr Leu Val Gly Glu Arg Gly 
515 520 525 

Ala Gin Leu Ser Gly Gly Gin Lys Gin Arg He Ala He Ala Arg Ala 
530 535 540 

Leu Val Arg Asn Pro Lys He Leu Leu Leu Asp Glu Ala Thr Ser Ala 
545 550 555 560 

Leu Asp Thr Glu Ser Glu Ala Glu Val Gin Ala Ala Leu Asp Lys Val 
565 570 575 



Ser Arg Leu 
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<210> 68 
<211> 218 
<212> PRT 

<213> Homo sapiens 
<400> 68 

Met Ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu lie Gly 
1 5 '10 15 

Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro lie 
20 25 30 

Gly Ser Gly Ala Gin Gly lie Val Cys Ala Ala Tyr Asp Ala lie Leu 
35 40 45 

Glu Arg Asn Val Ala lie Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 
50 55 60 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 70 75 80 

Asn His Lys Asn lie lie Gly Leu Leu Asn Val Phe Thr Pro Gin Lys 
85 90 95 

Ser Leu Glu Glu Phe Gin Asp Val Tyr lie Val Met Glu Leu Met Asp 
100 105 110 

Ala Asn Leu Cys Gin Val lie Gin Met Glu Leu Asp His Glu Arg Met 
115 120 125 

Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly lie Lys His Leu His Ser 
130 135 140 

Ala Gly lie lie His Arg Asp Leu Lys Pro Ser Asn lie Val Val Lys 
145 150 155 160 

Ser Asp Cys Thr Leu Lys lie Leu Asp Phe Gly Leu Ala Arg Thr Ala 
165 170 175 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 
ISO 185 190 

Ala Pro Glu Val lie Leu Gly Met Gly Tyr Lys Glu Asn Gly Gly Arg 
195 200 205 

Met Gly Lys Gly lie Phe Thr Arg Leu Gin 
210 215 



<210> 69 
<211> 307 
<212> PRT 

<213> Homo sapiens 
<400> 69 

Met Ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu He Gly 
15 10 15 
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Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro He 
20 25 30 

Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala He Leu 
35 40 45 

Glu Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro f*he Gin Asn Gin 
50 55 60 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 70 75 80 

Asn His Lys Asn He He Gly Leu Leu Asn Val Phe Thr Pro Gin Lys 
85 90 95 

Ser Leu Glu Glu Phe Gin Asp Val Tyr He Val Met Glu Leu Met Asp 
100 105 110 

Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg Met 
115 120 125 

Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu His Ser 
130 135 140 

Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys 
145 150 155 160 

Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala 
165 170 175 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 
ISO 185 190 

Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp Leu 
195 200 205 

Trp Ser Val Gly Cys He Met Gly Glu Met Val Cys His Lys He Leu 
210 215 220 

Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin 
225 230 235 240 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 
245 250 255 

Arg Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 
260 265 270 

Lys Leu Phe Pro Asp Val Leu Phe Pro Ala Asp Ser Glu His Asn Lys 
275 280 285 

Leu Lys Ala Ser Gin Tyr Phe Leu Gin He Cys Thr Phe Asn Pro He 
290 295 300 

Trp Gly Val 
305 



<210> 70 
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<211> 339 
<212> PRT 

<213> Homo sapiens 
<400> 70 

Met Ser Arg Ser Lys Arg Asp Asn Asn Phe Tyr Ser Val Glu He Gly 
15 10 15 

Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro He 
20 . 25 30 

Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala He Leu 
35 40 45 

Glu Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 
50 55 60 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 70 75 80 

Asn His Lys Asn He He Gly Leu Leu Asn Val Phe Thr Pro Gin Lys 
85 . 90 95 

Ser Leu Glu Glu Phe Gin Asp Val Tyr He Val Met Glu Leu Met Asp 
100 105 110 

Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg Met 
115 120 125 

Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu His Ser 
130 135 140 

Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys 
145 150 155 160 

Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala 
165 170 175 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 
180 185 190 

Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp Leu 
195 200 205 

Trp Ser Val Gly Cys He Met Gly Glu Met Val Cys His Lys He Leu 
210 215 220 

Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin 
225 230 235 240 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 
245 250 255 

Arg Thr Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Tyr Ser Phe Glu 
260 265 270 

Lys Leu Phe Pro Asp Val Leu Phe Pro Ala Asp Ser Glu His Asn Lys 
275 280 285 

Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val He 
290 295 300 
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Asp Ala Ser Lys Arg lie Ser Val Asp Glu Ala Leu Gin His Pro Tyr 

305 310 315 320 

lie Asn Val Trp Tyr Asp Pro Ser Glu Ala Glu Ala Arg Ser Cys Lys 

325 330 335 



Leu Phe Ser 



<210> 71 
<211> 178 
<212> PRT 

<213> Homo sapiens 



<400> 71 
Ala Arg Ser Gly 
1 

Val Arg Ala Val 
20 

Gly Ala Val Cys 

35 



Phe Tyr Arg Gin 
5 

Tyr Arg Asp Leu 

Ser Ala Val Asp 
40 



Glu Val Thr Lys 
10 

Gin Pro Val Gly 
25 

Gly Arg Thr Gly 



Thr Ala Trp Glu 
15 

Ser Gly Ala Tyr 
30 

Ala Lys Val Ala 
45 



lie Lys Lys Leu 
50 

Ala Tyr Arg Glu 
65 

lie Gly Leu Leu 



Thr Asp Phe Tyr 
100 

Leu Met Lys His 
115 

Tyr Gin Met Leu 
130 

His Arg Val Ser 
145 

Ala lie Pro Pro 



Tyr Arg Pro Phe 
55 

Leu Arg Leu Leu 
70 

Asp Val Phe Thr 
85 

Leu Val Met Pro 



Glu Lys Leu Gly 
120 

Lys Gly Leu Arg 
135 

Pro Gly Gly Glu 
150 

Pro Pro Arg Pro 
165 



Gin Ser Glu Leu 
60 

Lys His Met Arg 
75 

Pro Asp Glu Thr 
90 

Phe Met Gly Thr 
105 

Glu Asp Arg lie 



Tyr lie His Ala 
140 

Ala Ala His Gin 
155 

Thr Cys Glu Asp 
170 



Phe Ala Lys Arg 



His Glu Asn Val 
80 

Leu Asp Asp Phe 
95 

Asp Leu Gly Lys 
110 

Gin Phe Leu Val 
125 

Ala Gly lie lie 



Pro Ser Pro Ser 
160 

Val Met Gly Ser 
175 



Gly Cys 



<210> 72 
<211> 648 
<212> PRT 

<213> Homo sapiens 
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<400> 72 
Met Ser Pro Phe 
1 

Cys Gin Ser Cys 
20 

Val Lys Glu Tyr 
35 

Lys Pro Thr Met 
50 

Asn Lys Gly Arg 
65 

Leu lie Ser Glu 



Arg Lys Asn Asn 
100 

Gly Arg Met Leu 
115 



Lys Asp Met Asn 
130 

Arg Arg Gly Ala 
145 

Glu Phe Thr Ala 



His Glu Phe Val 
180 

Cys Asn Ala Ala 
195 

Cys Thr Gly Ser 
210 

Arg Phe Lys lie 
225 

Ser Pro Thr Phe 



Arg Gin Gly Leu 
260 

Cys Gin Thr Lys 
275 

Ala Glu Ala Leu 
290 

Arg Asp Thr Glu 
305 



Leu Arg lie Gly 

• 5 

Gin Gly Glu Ala 



Val Glu Ser Glu 
4 0 

Tyr Pro Pro Trp 
55 

Val Met Gin lie 
70 

Thr Thr Val Glu 
85 

Gly Lys Thr Glu 



Met Asn Ala Arg 
120 

Glu Phe Glu Thr 
135 

lie Lys Gin Ala 
150 

Thr Phe Phe Pro 
165 

Trp Gly Leu Asn 



lie His Lys Lys 
200 

Ala lie Asn Ser 
215 

Asp Met Pro His 
230 

Cys Glu His Cys 
245 

Lys Cys Asp Ala 



Val Ala Asn Leu 
280 

Ala Met lie Glu 
295 

Gin lie Phe Arg 
310 



Leu Ser Asn Phe 
10 

Val Asn Pro Tyr 
25 

Asn Gly Gin Met 



Asp Ser Thr Phe 
60 

lie Val Lys Gly 
75 

Leu Tyr Ser Leu 
90 

lie Trp Leu Glu 
105 

Tyr Phe Leu Glu 



Glu Gly Phe Phe 
140 

Lys Val His His 
155 

Gin Pro Thr Phe 
170 

Lys Gin Gly Tyr 
185 

Cys lie Asp Lys 



Arg Glu Thr Met 
220 

Arg Phe Lys Val 
235 

Gly Thr Leu Leu 
250 

Cys Gly Met Asn 
265 

Cys Gly lie Asn 



Ser Thr Gin Gin 
300 

Glu Gly Pro Val 
315 



Asp Cys Gly Ser 
15 

Cys Ala Val Leu 
30 

Tyr lie Gin Lys 
45 

Asp Ala His lie 



Lys Asn Val Asp 
80 

Ala Glu Arg Cys 
95 

Leu Lys Pro Gin 
110 

Met Ser Asp Thr 
125 

Ala Leu His Gin 



Val Lys Cys His 
160 

Cys Ser Val Cys 
175 

Gin Cys Arg Gin 
190 

Val lie Ala Lys 
205 

Phe His Lys Glu 



Tyr Asn Tyr Lys 
240 

Trp Gly Leu Ala 
255 

Val His His Arg 
270 

Gin Lys Leu Met 
285 

Ala Arg Cys Leu 



Glu lie Gly Leu 
320 
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Pro Cys 



Ser lie 



Lys Asn 
325 



Glu Ala Arg Pro Pro Cys Leu Pro Thr Pro 
330 335 



Giy Lys Arg Glu Pro Gin Gly lie Ser Trp Glu Ser Pro Leu Asp Glu 
. 340 345 350 

Val Asp Lys Met Cys His Leu Pro Glu Pro Glu Leu Asn Lys Glu Arg 
355" 360 365 

Pro Ser Leu Gin lie Lys Leu Lys lie Glu Asp Phe lie Leu His Lys 
370 375 ' 380 

Met Leu Gly Lys Gly Ser Phe Gly Lys Val Phe Leu Ala Glu Phe Lys 
385 390 395 400 

Lys Thr Asn Gin Phe Phe Ala lie Lys Ala Leu Lys Lys Asp Val Val 
405 410 415 

Leu Met Asp Asp Asp Val Glu Cys Thr Met Val Glu Lys Arg Val Leu 
420 425 430 

Ser Leu Ala Trp Glu His Pro Phe Leu Thr His Met Phe Cys Thr Phe 
435 440 445 

Gin Thr Lys Glu Asn Leu Phe Phe Val Met Glu Tyr Leu Asn Gly Gly 
450 455 460 

Asp Leu Met Tyr His lie Gin Ser Cys His Lys Phe Asp Leu Ser Arg 
465 470 475 480 

Ala Thr Phe Tyr Ala Ala Glu lie lie Leu Gly Leu Gin Phe Leu His 
485 490 495 

Ser Lys Gly lie Val Tyr Arg Asp Leu Lys Leu Asp Asn lie Leu Leu 
500 505 510 

Asp Lys Asp Gly His lie Lys lie Ala Asp Phe Gly Met Cys Lys Glu 
515 520 525 

Asn Met Leu Gly Asp Ala Lys Thr Asn Thr Phe Cys Gly Thr Pro Asp 
530 535 540 

Tyr lie Ala Pro Glu lie Leu Leu Gly Gin Lys Tyr Asn His Ser Val 
545 550 555 560 

Asp Trp Trp Ser Phe Gly Val Leu Leu Tyr Glu Met Leu lie Gly Gin 
565 570 575 

Ser Pro Phe His Gly Gin Asp Glu Glu Glu Leu Phe His Ser lie Arg 
580 585 590 

Met Asp Asn Pro Phe Tyr Pro Arg Trp Leu Glu Lys Glu Ala Lys Asp 
595 600 605 

Leu Leu Val Lys Val Arg Ser Glu Ala Lys Ser Val Phe lie Arg Arg 
610 615 620 

Ala Leu Gly Leu Leu Val Ser Phe Leu Phe Leu Leu Val Ser Asn Leu 
625 630 635 640 

His Val Ala Asn Asn Asp Tyr Tyr 
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